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METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR ACTIVITY 
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this invention. 

INTRODT JCTTON 
Techniral FielH 

The present invention relates to improved methods and compounds for modulating 
nuclear receptor activity. In particular, the present invention relates to improved methods and 
compounds for modulating estrogen receptor activity. 

Background 

Cells contain receptors that can elicit a biological response by binding various molecules 
including proteins, hormones and/or drugs. Nuclear receptors represent a super family of 
proteins that are horraone/ligand-activated transcription factors that enhance or repress 
transcription in a ceil type-, ligand- and promoter-dependent manner. A classic nuclear receptor, 
the estrogen receptor a (ERa) is a key factor in regulating the differentiation and maintenance of 
neural, skeletal, cardiovascular and reproductive tissues (Korach, Science 266:1524-1527 
(1994); Smith, et al., lisw Engl- J. Msd- 331:1056-1061 (1994)). The nuclear receptor family 
also includes receptors for glucocorticoids, androgens, mineralocorticoids, progestins, thyroid 
hormones, vitamin D, retinoids, peroxisome proliferators and eicosanoids. A subset of the 
nuclear receptor family are the steroid receptors, which include the estrogen, glucocorticoid and 
progestin receptors. 

25 Overall sequence conservation between nuclear receptors varies between different 

families of receptors; however, sequence conservation between functional regions, or modules, 
of the receptors is high. For example, nuclear receptors can be organized into functional 
modules comprising an N-terminal transcriptional activation domain, a central DNA binding 
domain (DBD), and a C-terminal ligand binding domain fLBD). The LED of nuclear receptors 
represents a hormone/ligand-dependent molecular switch, and recognizes a variety of 
compounds diverse in their size, shape and chemical properties. Accordingly, the estrogen 
hormones exert their physiological effects by binding to the estrogen receptor (Beato, et al., CdJ 
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83(6):85 1-857 (1995); Tsai. et al.. Annu . Rev. Biochem . 63:451-86 (1994)). Binding of a 
hormone to a nuclear receptor's LBD also changes its ability to modulate transcription of DNA, 
although they may have transcription-independent actions- 
All ERa ligands bind exclusively to the C-terminal LBD. Some of these ligands. 
including the endogenous estrogen. 17p-estradiol (Ej), and the synthetic nonsteroidal estrogen, 
diethylstilbestrol (DES), function as pure agonists whereas others such as ICI- 164.3 84 function 
as pure antagonists. Synthetic ligands such as tamoxifen and raloxifene (RAL) belong to a 
growing class of molecules known as selective estrogen receptor modulators (SERMs), which 
function as antagonists in specific tissue and promoter contexts (Grese, et al., Proc. Natl- Acad- 
Sci. USA 94:14105-10 (1997)). The remarkable tissue-specific behavior of tamoxifen was 
recently demonstrated in a breast cancer prevention trial, reported in Smigel, J. Natl- Cancer Inst. 
90:647-8 (1998), where a group of women at high risk for breast cancer who received tamoxifen 
treatment over a six year period, exhibited an increased incidence of endometrial cancer but a 
reduced occurrence of certain bone fractures and a dramatic 45% reduction in breast cancer 
1 5 incidence. The rational design of new SERMs and the optimization of existing ones require an 
understanding of the effects of different ligand chemistries and structures upon ERa 

transcriptional activity. 

Nuclear receptors also bind proteins, such as chaperone complexes, corepressors, or 
coactivators, that are involved in receptor function. In particular, ligand-dependent activation of 
20 transcription by nuclear receptors is mediated by interactions with coactivators. Receptor 
agonists promote coactivator binding and antagonists block coactivator binding. Hormone 
binding by a nuclear receptor can increase or decrease binding affinity to these proteins, and can 
influence or mediate the multiple actions of the nuclear receptors on transcription. 

Transcriptional activation by ERa is mediated by at least two separate activation 
25 functions (AFs) located within different domains of the protein, AF- 1 in the N-terminus, and AF- 
2 in the LBD. These AFs can act independently or cooperatively, depending on the cell type and 
the promoter context. The activity of AF-1 is regulated by growth factors acting through the 
MAP kinase pathway (Kato, et al.. Science 270:1491-1494 (1995)) and is generally believed to 
be activated in a ligand-independent manner, while AF-2 activity ("transcriptional activity") is 
30 responsive to ligand binding (Kumar, et al.. CeU 51 (6):94 1-951 (1987)). The binding of agonists 
triggers transcriptional activity whereas the binding of antagonists does not (Berry, et al., EMBO 
J. 9:281 1-8 (1990)). In addition, coactivators mediate transcriptional activity. The structural and 



BNSDCXID: <WO 9S506SaA2_l_> 



wo 99/50658 PCTAJS99/06937 



functional nature of the site to which coactivators bind has only recently been defined. Apriletti. 
et al., US Provisional No. 60/079,956, filed March 30. 1998, the disclosure of which is 
incorporated herein by reference. 

Recent structural studies suggest that ligands regulate transcriptional activity by directly 
5 affecting the structure of the LBD. Comparison of the structure of the unliganded human 

retinoid X receptor a LBD (Bourguet, et al.. Nature 375:377-82 (1995)) with the structures of 
the liganded LBDs of the human retinoic acid receptor y (RARy) (Renaud, et al.. Nature 
378:681-689 (1995) and Wurtz, et al., Nat. Struct . Biol . 3:87-94(1996)), the thyroid hormone 
receptor a (TRa) (Wagner, et al.. Nature 378:690-697 (1995)), the progesterone receptor 
10 (Williams, et al.. Nature 393:392-395 (1998)) and the ERa (Brzozowski, et al.. Nature 389:753- 
758 (1997); Tanenbaum. et al., Proc . Natl- Acad . Sci. USA 95:5998-6003 (1998)) suggests that 
an agonist-induced conformational change involving the repositioning of helix 12. the most C- 
terminal helix of the LBD, is essential for transcriptional activity. Because certain point 
mutations in helices 3, 5 and 1 2 abolish transcriptional activity but have no effect on ligand or 
1 5- DNA binding, these regions of the LBD have been predicted to form part of a recognition 

surface, created in the presence of agonist, for molecules that link the receptor to the general 
transcriptional machinery (Danielian, etal., EMBO J. 1 1:1025-33(1992); Feng, et al.. Science 
280:1747-9 (1998); Henttu, et al., Mol. Cell. Bioi. 17:1832-9 (1997); Wrenn, et al., J. Biol . 
Chem. 268:24089-24098 (1993)). The structures of the LBD complexed with Hz and RAL show 
that although both ligands bind at the same site within the core of the LBD (Brzozowski. et al., 
suEia), each of these ligands induces a different conformation of helix 12. Whereas helix 12 in 
the E2-LBD complex packs against the helices 3. 5/6 and 11 in a conformation that has been 
observed for the corresponding helix in other agonist-bound NR LBD structures, helix 12 in the 
RAL-LBD complex is bound in a hydrophobic groove composed of residues from helices 3 and 
25 5. This alternative orientation of helix 1 2 partially buries residues in the groove that are 

necessary for transcriptional activity, suggesting that RAL and possibly other antagonists block 
transcriptional activity by disrupting the topography of the coactivator binding site surface. 

Biochemical and genetic approaches have led to the identification of several proteins that 
associate in a ligand-dependent manner with ERa (Horwitz, et a!., Mol. Endocrinol . 10:1 167- 
1177 (1996)) including SRC- 1/N-CoAl (Onate. et al.. Science 270:1354-1357 (1995)), 
GRIP1/TIF2/SRC-2 (Hong, et al., Proc. Natl . Acad . Sci. USA 93(10):4948-4952 (1996) and 
Voegel, et al. EMBO J. 15:3667-3675 (1996)). p/CIP/RAC3/ACTR/AIBl/SRC-3 (Anzick. et al.. 
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Science 277:965-968(1997). Chen, et ah, CeU 90(3):569-80 (1997), Li, et aL, Proc. Natl. Acad. 
Sci. USA 94:8479-84 (1997) and Torchia, et al.. Nature 387:677-684 (1997)) and CBP/p300 
(Hanstein, et aL, Proc . Natl . Acad . Sci . USA 93:11 540- 11 545 ( 1 996)). These proteins have been 
classified as transcriptional coactivators because they enhance ligand-dependent transcriptional 
5 activation by ERa as well as by several other NRs (Glass, et al., Curr. Opin . Cell Biol . 9:222-32 
(1997); Torchia, et aL, supra ). The observation of partial hormone resistance in mice with a 
disrupted SRC-1 gene (Xu, et al.. Science 279:1922-1925(1998)) provides compelling evidence 
that coactivators are required for NR function in vivo. Consistent with its proposed role in AF-2 
directed transcriptional activation, SRC-1 possesses histone acetylase activity and the ability to 
10 interact not only with agonist-bound receptors but also with other coactivators and several 
general transcription factors (Kamei, et al., CeU 85(3):403-14 (1996); Onate, et aL, supra: 
Spencer, et aL, Nature 389:194-8 (1997); Takeshita, et aL, Endocrinology 137:3594-7 (1996)). 
SRC-1 and GRIPl also bind to the agonist-bound LBDs of both the human TR(i and human ERa 
using the putative coactivator binding site (Feng, et al., supra). 
15 Members of the pi 60 family of coactivators such as SRC-l, GRIP1/TIF2/SRC-2, and 

p/CIP/RAC3/ACTR/AIBl/SRC-3 as well as other coactivators recognize agonist-bound NR 
LBDs through a short signature sequence motif, LXXLL (SEQ ID NO:l) (where L is leucine and 
X is any amino acid), knowrn as the NR box (Ding, et aL, Mol- Endocrinol. 12:302-313 (1998); 
Heery, et aL, Nature 387:733-736 (1997); Le Douarin, et aL, EMEQ J. 15:6701-15 (1996); 
20 Torchia, et aL, supra ). Mutagenesis studies indicate that the affinity of coactivators for NR 
LBDs is determined principally, if not exclusively, by these NR boxes (Ding, et aL, supra): 
Heery, et aL, Nature 387:733-736 (1997); Le Douarin, et aL, EMBO J. 15:6701-15 (1996); 
Torchia, et aL, supra ). Each of the pi 60 coactivators contains several NR boxes. The NR boxes 
within SRC-1, GRIPl and TIF2 have been demonstrated to recognize different NRs with 
25 different affinities (Ding, et aL, supra ; Kalkhoven, et aL, EMBO J. 17:232-43 (1998); Voegel, et 
aL, EMBO J. 17:507-19 (1998)), but the reasons for these binding preferences are unknown. 

Darimont, et aL, "Structure and specificity of nuclear receptor-coactivator interactions" 
Genes Dev . 12:3343-3356 (1998) describes structural studies of the complex between TRP and 
the GRIPl NR Box II peptide and biochemical studies of GRIPl binding to TRp and GR. The 
30 PPARy/SRC-l peptide complex is described in Nolte, et aL, Nature 395:137-143 (1998). 

The medical importance of nuclear receptors is significant. They have been implicated in 
breast cancer, prostate cancer, cardiac arrhythmia, infertility, osteoporosis, hypenhyroidism, 

4 
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hypercholesterolemia, obesity and other conditions. For example, compounds that modulate 
ERa transcriptional activity are currently being used to treat osteoporosis, cardiovascular disease 
and breast cancer (Gradishar. et al., J. Clin . Oncol . 1 5:840-52 (1997) and Jordan. J. Natl . Cancer 
Inst. 90:967-71 (1998)). 

5 A need continues to exist for further identification and characterization of the key 

residues within the ligand binding domains of the nuclear receptors, and molecules that affect the 
receptor by binding to these sites. Understanding these interactions provides a basis for iterative 
drug design, synthesis, and selection. It also would be advantageous to devise methods and 
compositions for reducing the time required to discover compounds that target these binding 

1 0 sites and administer them to organisms to modulate physiological processes regulated by the 
nuclear receptors, and the estrogen receptor in particular. 

SUMMARY OF THE INVENTION 
The present invention relates to the further identification and manipulation of the ligand 
binding domain (LBD) of nuclear receptors, which facilitates the design of compounds that bind 

15 to the LBD and modulate nuclear receptor activity, and the estrogen receptor in particular. The 
compounds include agonists and antagonists that modulate nuclear receptor activity, and can be 
receptor-, cell- and/or tissue-specific. In particular, the compounds modulate nuclear receptor 
activity by affecting coactivator-coactivator binding site interactions. 

The present invention also includes protein cocrystals of the nuclear receptors with an 

20 agonist bound to the LBD and a peptide bound to the coactivator binding site and methods for 
making them. Similarly, the invention also includes protein cocrystals of the nuclear receptors 
with an antagonist bound to the LBD and methods for making them. The cocrystals provide 
means to obtain atomic modeling information of the specific amino acids and their atoms 
forming the LBD and coactivator binding sites and that interact with molecules that bind to the 

25 sites. The cocrystals also provide modeling information regarding the ligand:nuclear receptor 
and coactivator:nuclear receptor interactions, as well the structure of ligands boimd thereto. 

The present invention further provides methods for identifying and designing molecules 
that modulate ligand binding to a nuclear receptor using atomic models of nuclear receptors. 
The method involves modeling test compounds that fit spacially into a nuclear receptor LBD 

30 using an atomic structural model comprising a nuclear receptor LBD or portion thereof, 

screening the test compounds in an assay, such as a biological assay, characterized by binding of 

5 
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a test compound to the nuclear receptor LBD, and identifying a test compound that modulates 
ligand binding to the receptor. 

The invention also includes compositions and methods for identifying key residues 
within the LBDs of nuclear receptors. The methods involve examining the surface of a nuclear 
5 receptor of interest lo identify residues that modulate ligand and/or coactivaior binding. The 
residues can be identified by homology to the key residues within the LBD of human ERa 
described herein. Overlays and superpositioning with a three dimensional model of a nuclear 
receptor's LBD, and/or a portion thereof, also can be used for this purpose. Additionally, 
alignment and/or modeling can be used as a guide for the placement of mutations on the LBD 
10 surface to characterize the nature of the site in the context of a cell. 

Also provided is a method of modulating the activity of a nuclear receptor. The method 

w 

can be in vitro or in vivo. The method comprises administering in vitro or in vivo a sufficient 
amount of a compound that binds to the ligand binding domain and acts either as an agonist or an 
antagonist. Preferred compounds bind to the site with greater affinity than ligands found in a cell 
15 of interest. 

The invention further includes a method for identifying an agonist or antagonist of ligand 
binding to a nuclear receptor. The method comprises providing the atomic coordinates 
comprising a nuclear receptor ligand binding domain or portion thereof to a computerized 
modeling system; modeling compounds which fit spatially into the nuclear receptor ligand 

20 binding domain; and identifying in an assay, for example a biological assay, for nuclear receptor 
activity a compound that increases or decreases activity of the nuclear receptor through binding 
the ligand binding domain. 

Also provided is a machine-readable data storage medium with information for 
constructing and manipulating an atomic model comprising the ligand binding domain or 

25 portions thereof. The medium comprises a data storage material encoded with machine readable 
data which, when using a machine programmed with instructions for using said data, is capable 
of displaying a graphical three-dimensional representation of a molecule or molecular complex 
for a nuclear receptor ligand binding domain. 

Also provided is a method of identifying a compound that selectively modulates the 

30 activity of one type of nuclear receptor compared to other nuclear receptors. The method is 
exemplified by modeling test compounds that fit spatially and preferentially into a nuclear 
receptor ligand binding domain of interest using an atomic structural model of a nuclear receptor 

6 
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LBD. selecting a compound that interacts with one or more residues of the LBD unique in the 
context of that site, and identifying in an assay, for example a biological assay, for ligand 
binding activity a compound that selectively binds to the LBD compared to other nuclear 
receptors. TTie unique features involved in receptor-selective ligand binding can be identified by 
5 comparing atomic models of different nuclear receptors or isoforms of the same type of receptor. 

The invention fmds use in the selection and characterization of peptide, peptidomimetic. 
as well as other compounds, such as small organic molecules, identified by the methods of the ' 
invemion, particularly new lead compounds useful in treating nuclear receptor-based disorders 
in particular steroid receptor-based disorders, and more specifically estrogen receptor-based 
1 0 disorders. 

BRIEF DE SCRIPTION DF THK PR AWTTsJr.Q 
Figure 1 provides a stereo view of the electron density of the complexes, where Figure 
1 A is a stereo view of the electron density of the DES-ERa LBD-GRIPl NR Box II peptide 
complex and Figure IB is a stereo view of the electron density of the OHT-ERa LBD complex. 
Figure 1 is a black and white graphical representation of a figure that was generated using 
BOBSCRIPT (Esnouf, J. Mol. GraEh. Model- 15, 132-4, 1 12-3 (1997)) and rendered using 
Raster3D (Merritt, et al.. Acta Crvstallopr D 50:869-873 (1994)). 

Figure 2 was generated using BOBSCRIPT and rendered using Raster3D as described 
above. Figure 2A shows the overall structure of the DES-ERa LBD-GRIPl NR Box II peptide 
complex in two orthogonal views. Figure 2B shows the overall structure of the OHT-ERa LBD 
complex in two orthogonal views similar to those of the agonist complex in Figure 2A. 

Figures 3A and 3B were generated using BOBSCRIPT and rendered using Raster3D as 
described above. Figures 3C and 3D were created using GRASP (NichoUs. GRASP Manual 
(New York: Columbia University, 1992)). Figure 3 A shows a close-up view of the coactivator 
peptide bound to the LBD, i.e.. the NR Box II peptide/LBD interface. The regions of the LBD 
that do not interact with the peptide have been omitted for clarity. Helices 3. 4 and 5 are labeled 
H3, H4 and H5 respectively. The side chains of receptor residues which interact with the peptide 
are depicted, except for Lys 362 (blue) and Glu 542 (red), the side chains are colored by atom 
type (carbon and sulfur atoms are colored green, oxygen atoms are colored red and nitrogen 
atoms are colored blue). Helix 12 is colored magenta. The peptide, colored gold, is depicted as 
a Ca wonn; only the side chains of lie 689 and the three motif leucines (Leu 690, Leu 693 and 
Leu 694) are drawn (Figure 3C). Figure 3B shows the helix 12/LBD interface as a close-up view 
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of the OHT-LBD complex showing helix 12 bound to part of the coactivator binding site. Only 
the side chains of residues that interact with helix 1 2 are drawn (with the exception of side chain 
of His 373 which is omitted for clarity). Except for Lys 362 (blue) and Glu 380 (red), the side 
chains are colored by atom type as specified for Figure 3 A. Residues 530-551 are depicted as a 
Ca worm; residues 536-544 are colored magenta. The side chains of Leu 536, Tyr 537. Leu 
540, Met 543 and Leu 544 are shown. Figure 3C is a molecular surface representation showing 
the electrostatic surface of the ERa LBD bound to the NR Box II peptide as positive (blue) and 
negative (red) regions, as calculated in GRASP. The coactivator peptide is depicted as in Figure 
3A and the view is equivalent to that in Figure 3 A. The side chains of Leu 690 and Leu 694 are 
bound in a hydrophobic groove and those of He 689 and Leu 693 rest against the edge of this 
groove. Figure 3D shows the electrostatic surface of the ERa LBD complexed with OHT, 
showing positive (blue) and negative (red) regions as in Figure 3C. Residues 530-51 1 are 
depicted as in Figure 3B and the view is equivalent to that in Figure 3B. Whereas the side chains 
of Leu 540 and Leu 544 are embedded in the hydrophobic groove, that of Met 543 lies along the 
edge of this groove. 

Figure 4 was generated using LIGPLOT (Wallace, et al.. Protein Eng. 8:127-34 (1995)) 
and provides schematic diagrams illustrating the DES interactions with the LBD (Figure 4A) and 
OHT interactions with the ligand binding pocket (Figure 4B). Residues that interact with the 
ligands are drawn at approximately their true positions. The residues that form van der Waals 
contacts with ligand are depicted as labeled arcs with radial spokes that face towards the ligand 
atoms with which they interact. The residues that hydrogen bond to ligand are shown in ball- 
and-stick representation. Hydrogen bonds are represented as dashed cyan lines and the distance 
of each bond is given. The ligand rings and the individual ligand atoms are labeled. 

Figure 5 was generated using BOBSCRIPT and rendered using Raster3D as described 
above, and shows a comparison of helix 12 from the OHT complex and the NR Box II peptide. 
Figure 5A and Figure 5B are stereo views. The structures of the OHT-LBD complex and the 
DES-LBD-NR Box II peptide complex were overlapped using the Ca coordinates of residues 
306-526 of the LBD. Helix 12 from the DES-LBD-coactivator peptide complex is omitted for 
clarity. Residues 536-551 (helix 12=residues 536-544) from the OHT-LBD complex are colored 
magenta and the peptide is colored gold. The hydrogen bonds between the e-amino group of Lys 
362 and the backbone carbonyls of residues 543 and 544 of helix 12 are illustrated as dashed 
magenta lines. The hydrogen bonds between the e-amino group of Lys 362 and the backbone 
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carbonyls of residues 693 and 696 of the coactivator peptide are depicted as dashed orange lines. 
The following abbreviations are used on helix 12: L540=Leu 540. M543=Met 543, and 
L544=Leu 544. The following abbreviations are used on the peptide: L690=Leu 690, L693=Leu 
693 and L694=Leu 694. 

Figures 6A and 6D were generated using BOBSCRIPT and rendered using Raster3D as 
described above. Figures 6B and 6C were created using MidasPlus (Huang, et al., J. Moi. Graph. 
9:230-6. 242 (1991)). Figure 6A shows that agonists and antagonists promote different LED 
conformations, as ribbon representations of the DES complex (without the coactivator peptide), 
the OHT complex and the complex such as is described in Tanenbaum, et al., suera. The 
hormones are shown in space-filling representation. In each complex, helix 1 2 is colored 
magenta and the main chain of residues 339 to 341, 421 to 423, and 527 to 530 is indicated in 
red. Helices 3, 8 and 11 (H3, H8 and HI 1 respectively) are labeled in the DES complex. Figure 
6B shows DES bound in the ligand binding cavity. A cross-section of a space-filling model of 
the LBD bound to DES (green) showing the ligand completely embedded in the ligand binding 
cavity. The A' ring of DES (A'). Phe 404 (404), Met 42 1 (42 1 ) and Phe 425 (425) are labeled. 
The carbon atoms of side chain of Met 421 are colored magenta, and the sulfur atom is colored 
yellow. Figure 6C is a cross-section of a space-filling model of OHT (red) bound in the ligand 
binding pocket. The view is equivalent to that in Figure 6B. The B ring of OHT (B) Phe 404 
(404), Met 421 (421) and Phe 425 (425) are labeled. The side chain of Met 421 is colored as in 
Figure 6B. The conformation of the B ring forces Met 421 to adopt a different conformation 
than in the one it adopts in the DES complex (compare with Figure 6B). Figure 6D provides a 
comparison of the ligand binding pocket bound to DES (green) and to OHT (red). The structures 
of the OHT complex and the DES complex were overiapped as in Figure 5. The A rings of both 
ligands point out of the page; the B ring of OHT and the A ring of DES point into the page. The 
LBD bound to OHT is colored blue and the LBD bound to DES is colored light blue-grey. The 
side chains of some of the residues whose conformations are dramatically differem between the 
two complexes are drawn; Met 342 (342), Met 343 (343). Phe 404 (404), Met 421 (421) He 424 
(424), Phe 425 (425), His 524 (524), Leu 525 (525), Met 528 (528). The sulfur atom of Met 421 
IS colored yellow in both structures. 

Figure 7 illustrates a model of antagonist action. Agonist (white triangle) binding 
stabilizes a conformation of the LBD that promotes coactivator (yellow) binding. Residues 527- 
530 (red) are part of helix 1 1 (blue) and the length of the interhelical loop prevents helix 12 
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(magenta) from binding to the static region of the surface involved in transcriptional activity. 
Antagonist (white cross) side chains preclude helix 1 2 from being positioned over the ligand 
binding pocket. Residues 527-530 (red) adopt an extended conformation as a result of 
aiitagonist-driven structural perturbations in and around the ligand binding pocket. The length of 
the loop between helices 1 1 and 12 allows helix 12 to bind the static region of this surface and 

inhibit coactivator recognition. 

Figure 8 shows alignment of amino acid sequences (single letter amino acid designations) 
containing residues that form the coactivator binding sites of several nuclear receptors: human 
and recombinant thyroid hormones (hTR|3 and rTRa) (SEQ ID NO:5 and 6 and SEQ ID NO:7 
and 8), retinoids (hRARy and hRXRa) (SEQ ID NO:9 and 10 and SEQ ID NO: 11 and 12), 
peroxisome (hPPARy) (SEQ ID NO: 13 and 14), vitamin D (hVDR) (SEQ ID NO: 15 and 16), 
estrogen (hERa) (SEQ ID NO: 17 and 18), glucocorticoid (hGR) (SEQ ID NO: 1 9 and 20), 
progestin (hPR) (SEQ ID NO:21 and 22), mineralocorticoid (hMR) (SEQ ID NO:23 and 24) and 
androgen (hAR) (SEQ ID NO:25 and 26). The boxes represent residues of alpha-helix (H3. H4, 
H5, H6 and HI 2); lower case letters "h" and "q" represent hydrophobic and polar residues, 
respectively. The numbered positions at the top of the table, 280, 281, etc., correspond to hTRp 
residues. 

DESCRIPTION OF SPECF ir F.MRODIMENTS 
The present invention provides methods and compositions for identifying compounds 
that modulate nuclear receptor activity, in particular steroid receptor activity, and more 
particularly estrogen receptor activity. The compounds are nuclear receptor agonists or 
antagonists that bind to the ligand binding domain. Compounds that bind to the LBD also are 
provided. The compounds can be natural or synthetic. Preferred compounds are small organic 
molecules, peptides and peptidomimetics (e.g., cyclic peptides, peptide analogs, or constrained 
peptides). 

Certain residues within the ERa LBD have been identified that are of particular 
importance:Met343, Leu346, Ala350, Glu353. Leu384, Leu387, Leu391, Arg394. Phe404, 
Met421. Leu428, Gly521, His524, Leu525 and Met528 (See Figure 4 A). Of these, some have 
been found to directly or indirectly effect the positioning of helix 12: Met343. Met421 , His524, 
Leu525 and Met528. Interactions with these particular residues, such as occurs when DES binds 
to the receptor stabilizes a conformation of the LBD that promotes coactivator binding. 
Modifications to a ligand that enhance binding or interaction with these residues would provide 
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for an improved agonist of receptor activity. Similarly, modifications to a ligand that adversely 
affects the binding or interaction with these residues would provide for an improved antagonist 

In addition, it is believed that the ERa Tyr537 residue plays a role in stabilizing the 
unliganded receptor so that helix 1 2 is free to interact with the coactivator binding site. The ER 
is quite unique in having a tyrosine at this position as hTRp, rTRa, hRARy, hGR. hPR. hMR and 
hAR all have a proline residue. hRXRa has as aspartic acid residue, hPPARy has a histidine 
residue and hVDR has a threonine residue at positions corresponding to the Tyr 537 residue of 
hERa. Therefore, selective agonists and antagonists can be designed for the estrogen receptor 
that interact with Tyr 537. 

Traditional antagonists such as OHT or RAL have side chains that directly block helix 12 
positioning as it is in the agonist complexes (DES or E,). In addition, it is expected that ligand-^ 
induced perturbations are required to allow helix 12 to reach into the coactivator helix binding 
site, thereby blocking coactivator binding and inhibiting transcriptional activity. Such 
perturbations or interference with proper interactions with one or more of these LBD residues, 
has been found to relax the receptor, disassembling the receptor's secondary structure and 
resulting in unwinding of helix 1 1. The unwinding of helix 1 1 increases the length of the loop 
between helices 1 1 and 12, allowing helix 12 to move away from the ligand binding pocket and 
towards the coactivator binding site, where it occludes the coactivator recognition groove by 
mimicking the interactions of the coactivator, and thus inhibits coactivator recognition (see 
Figure 7). Modifications to a ligand that interfere with binding or interaction with one or more 
of the amino acids positions indicated, would cause receptor relaxation, affecting the receptor's 
secondary structure and cause the unwinding of helix 12. Compounds based upon such modified 
ligands would act as antagonists. 

Accordingly, one aspect of the invention is a method of identifying a compound that 
modulates (i.e., increases or decreases) nuclear receptor activity, comprising: modeling test 
compounds that fit spatially into a nuclear receptor ligand binding domain of interest using an 
atomic structural model of the estrogen receptor a ligand binding domain or ponion thereof, 
screening the test compounds in an assay, for example a biological assay, characterized by 
binding of a test compound to the ligand binding domain, and identifying a test compound that 
modulates nuclear receptor activity, wherein the atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human estrogen receptor a 
Met343, Leu346. Ala350. Glu353, Leu384. Leu387. Leu391. Arg394, Phe404. Met421. Leu428. 
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Glv52L His524, Leu525 and Mei528. preferably Met343, Met42L His524, Leu525 and Met528. 
In a preferred embodiment, the nuclear receptor is the ER. The test compound can be an agonist 
and nuclear receptor activity is measured by binding of a coactivator or a compound that mimics 
a coactivator, to the coactivator binding site, as defined below. On the other hand, the test 

5 compound can be an antagonist and nuclear receptor activity is measured by the unwinding of 
helix 12 and/or the blocking of coactivator binding to the coactivator binding site. The screening 
is typically in vitro, and high throughput screening is. preferable. Suitable test compounds can be 
designed, as is described later, or can be obtained from a library of compounds, and include, by 
means of illustration and not limitation, small organic molecules, peptides and peptidomimetics. 

1 0 The method described above may also include the step of providing the atomic coordinates of the 
estrogen receptor a ligand binding domain or portion thereof to a computerized modeling 

system, prior to said modeling step. 

As used herein, the term "portion thereof is intended to mean the atomic coordinates 
corresponding to a sufficient number of residues or their atoms of the LBD that interact with a 
1 5 compound capable of binding to the site. This includes receptor residues having an atom within 
4.5 A of a bound compound or fragment thereof. Thus, for example, the atomic coordinates 
provided to the modeling system can contain atoms of the nuclear receptor LBD, part of the LBD 
such as atoms corresponding to the LBD or a subset of atoms useful in the modeling and design 

of compounds that bind to a LBD. 

20 The atomic coordinates of a compound that fits into the ligand binding domain also can 

be used for modeling to identify compounds or fragments that bind the site. By "modeling" is 
intended quantitative and qualitative analysis of molecular structure/function based on atomic 
structural information and receptor-ligand agonists/antagonists interaction models. This includes 
conventional numeric-based molecular dynamic and energy minimization models, interactive 

25 computer graphic models, modified molecular mechanics models, distance geometry and other 
structure-based constraint models. Modeling is preferably performed using a computer and may 
be further optimized using known methods. By "fits spatially" is intended that the three- 
dimensional structure of a compound is accommodated geometrically by a cavity or pocket of a 
nuclear receptor LBD. 

30 It is expected that targeting the corresponding amino acids on other nuclear receptors will 

have the same effect. Accordingly, one embodiment of the invention pertains to methods of 
designing antagonists that bind the LBD of a nuclear receptor but do not interact with one of 

12 



BNSOOCID: <WO 995065aA2_l_> 



wo 99/50658 



PCTAJS99/06937 



10 



15 



20 



25 



30 



more residues within the LBD that correspond to (i.e.. the same as or equivalent to) human ERa 
residues Met343. Leu346, Ala350, Glu353, Leu384. Leu387. Leu391. Arg394. Phe404. Met42I 
Leu428, Giy521. His524, Leu525 and Met528, preferably Met343. Met421, His524. Leu525 and 
Met528. Similarly, another embodiment of the invention pertains to methods of designing 
agonists that bind the LBD of a nuclear receptor and have enhanced interaction with one or more 
residues within the LBD that correspond to the human ERa residues Met343, Leu346. Ala350 
G1U353, Leu384. Leu387. Leu391. Arg394, Phe404. Met421, Leu428, Gly521, His594 Leu525 
and Met528, preferably Met343, Met421, His524, Leu525 and Met528. An example of 
enhanced interaction is where the agonist has a greater binding affinity for one or more of said 
residues, as compared to an endogenous ligand. Such corresponding positions for other 
members of the nuclear receptor family are shown in Table 1. which provides an alignment of 
ammo acid sequences (one letter amino acid designations) containing residues from the ligand 
binding domains of several nuclear receptors that correspond to the designated positions on the 
human estrogen receptor (hERa): recombinant thyroid hormone (rTRa), retinoids (hRARy and 
hRXRa), glucocorticoid (hGRa). progestin (hPR), mineralocorticoid (hMR) and androgen 
(hARa). It is understood that Table I is merely illustrative of the invention and is not intended 
to be limiting in any mamier. Accordingly, it is understood that corresponding amino acid 
residues of other nuclear receptors such as other estrogen receptors, thyroid receptors, retinoid 
receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, androgen 
receptors, peroxisome receptors and vitamin D receptors, may also be used in the methods of the 
invention. 



hERa 


M343 


M421 


hPR 


L715 


F794 


hARa 


L701 


M780 


hGRa 


M560 


M639 


hMR 


L766 


M845 


hRARy 


W227 


F304 


hRXRa 


V265 


G343 


rTRa 


F215 


L292 



Table 1 

F425 
1798 
Q784 

Q643 
L849 
L308 
F346 
V296 



H524 
Y890 
F876 
Y735 
F941 
R396 
H435 
R384 



L525 
C891 
T877 

C736 
C942 
A397 
L436 
F385 



M528 
T894 

L881 
T740 
T946 
L40I 
F440 
M389 



* iViJO' 

The term "coactivator binding site" is used herein to mean a structural segment or 
segments of the nuclear receptor polypeptide chain folded in such a way so as to give the proper 
geometry and amino acid residue conformation for binding coactivator. This is the phvsical 
arrangement of protein atoms in three-dimensional space forming a coactivator binding site 
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pocket or cavity. As described by Aprileiti. et al.. supra , residues forming the coactivator 
binding site on nuclear receptors are amino acids that correspond to (i.e.. the same as or 
equivalent to) human TR residues ofC-terminal helix 3 (Ile280. Thr281. Val283. Val284, 
Ala287. and Lys288), helix 4 (Phe293). helix 5 (Gln301, Ile302. Leu305. Lys306), helix 6 
(Cys309). and helix 12 (Pro453. Leu454. Glu457, Val458 and Phe459). as shown in Figure 8. 
The coactivator binding site is highly conserved among the nuclear receptor super family. Thus, 
this site corresponds to a surprisingly small cluster of residues on the surface of the LBD that 
form a prominent hydrophobic cleft. The hydrophobic cleft is formed by hydrophobic residues 
corresponding to human TR residues of C-terminal helix 3 (Ile280. Val283. Val284, and 
Ala287), helix 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 (Leu454, 
Val458 and Phe459). This hydrophobic cleft of the coactivator binding site is also highly 
conserved among the nuclear receptor super family. 

Based upon the Examples set forth herein, residues forming the coactivator binding site 
on the estrogen receptor were found to correspond to those positions described above for the 
human TR. Accordingly, the residues forming the coactivator binding site on ERa are the 
human ERa residues of C-terminal helix 3 (Leu354, Val355, Met357, Ile358, Ala361, and 
Lys362), helix 4 (Phe367), helix 5 (Gln375, Val376, Leu379, Glu380), helix 6 (Trp383), and 
helix 12 (Asp538, Leu539, Glu542, Met543 and Leu544), as shown in Figure 8. As noted above 
for the nuclear receptor family in general, this site corresponds to residues on the surface of the 
LBD that form a prominent hydrophobic cleft, formed by hydrophobic residues corresponding to 
human ERa residues of C-terminal helix 3 (Leu354, Met357, lle358 and Ala361), helix 4 
(Phe367), helix 5 (Val376, Leu379). helix 6 (Trp383), and helix 12 (Leu539. Met543 and 
Leu544). This corroborates the data presented by Apriletti, et al., supra, for the nuclear receptor 
family. 

Structural analysis has revealed the mechanisms by which tamoxifen and other SERMs 
bind the ligand binding domain and block coactivator binding and hence transcriptional activity. 
By this, an understanding of ligand and coactivator binding has also been achieved. Therefore, 
the coactivator binding site residues described above are useful in designing coactivator mimics 
that have broad application in the methods of the instant invention. Such "coactivator mimics" 
are peptides or polypeptides that mimic the coactivator binding site recognition area on the 
surface of a coactivator such that a "coactivator mimic" acts as a competitive inhibitor of 
coactivator binding to the coactivator binding site. Coactivator mimics can be used in an assay 
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to detennine receptor activity and hence the agonist or antagonist nature of a test compound, in 
that an agonist will permit a coactivator mimic to bind to the coactivator binding site, while an 
antagonist will prevent such binding. In addition, such coactivator mimics may have therapeutic 
utility when administered in combination with an agonist compound of the invention. 

Another embodiment of the invention pertains to a method of identifying a compound 
that modulates ligand binding to a nuclear receptor, typically by binding to the ligand binding 
domain. This method comprises the steps of modeling test compounds that fit spatially into a 
nuclear receptor ligand binding domain of interest using an atomic structural model of the 
estrogen receptor a ligand binding domain or portion thereof, screening the test compounds in an 
assay characterized by binding of a test compound to the binding domain, and idemifying a test 
compound that modulates ligand binding to said nuclear receptor, wherein said atomic structural 
model comprises atomic coordinates of amino acid residues corresponding to residues of human 
estrogen receptor a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391. Arg394, 
Phe404, Met421, Leu428, Gly52l, His524, Leu525 and Met528, preferably Met343, Met421, 
His524, Leu525 and Met528. In the prefeired method, the nuclear receptor is ER, TR, GR or 
PR. The screening is typically in vitro such as by high throughput screening. Suitable test 
compounds can be designed or obtained from a library of compounds, and include, by means of 
illustration and not limitation, small organic molecules, peptides and peptidomimetics. The test 
compounds can be either agonists or antagonists of ligand binding. 

The invention also includes compositions and methods for identifying key residues 
within the ligand binding domains of nuclear receptors. The methods involve examining the 
surface of a nuclear receptor of interest to identify residues that modulate ligand and/or 
coactivator binding. The residues can be identified by homology to the key residues on the LBD 
of human ERa described herein. A preferred method is alignment with the residues of any 
nuclear receptor corresponding to (i.e., equivalent to) human ERa residues of Met343. Leu346, 
Ala350, Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Met421, Leu428, Gly521. His524, 
Leu525 and Met528, preferably Met343, Met421, His524. Leu525 and Met528. Overiays and ' 
superpositioning with a three-dimensional model of a nuclear receptor LBD, or a ponion thereof 
that contains these or corresponding residues, also can be used for this purpose. For example, 
three-dimensional structures of TR, GR and PR LBDs can be used for this purpose. For 
example, nuclear receptors identifiable by homology alignment include normal nuclear receptors 
or proteins structurally related to nuclear receptors found in humans, natural mutants of nuclear ^ 
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receptors found in humans, normal or mutant receptors found in animals, as well as non- 
mammalian organisms such as pests or infectious organisms, or viruses. 

Alignment and/or modeling also can be used as a guide for the placement of mutations on 
the LBD surface to characterize the nature of the site in the context of a cell. Selected residues 
5 are mutated to preserve global receptor structure and solubility in the case of an agonist, or to 
' disassemble such structure and permit helix 12 to unwind, as is the case with an antagonist. 
Mutants can be tested for ligand binding as well as the relative change in strength of the binding 
interaction. Ligand-dependent coactivator interaction assays also can be tested for this purpose. 

such as those described herein. 
1 0 In particular, the present invention relates to the structural and functional effects on the 

estrogen receptor's LBD. of the binding of two chemically-related compounds, the agonist, 
diethylstilbestrol (DES), and the selective antagonist 4-hydroxytamoxifen (OHT), the active 
metabolite of tamoxifen. As described in the Examples, mutagenesis and binding studies, 
coupled with analysis of atomic models derived from cocrystals, reveals the structure of the 
1 5 human estrogen receptor a ligand binding domain (ERa LBD) co-crystallized with a peptide 
molecule comprising a GRIP! NR Box II peptide sequence (SEQ ID NO:4) (i.e., a peptide 
derived from the NR Box II region of the pi 60 coactivator GRIPl) bound to the coactivator 
binding site and the agonist, DES. Also revealed is the structure of the ERa LBD co-crystallized 
vAth the antagonist, OHT. The Examples provide the 2.03 A resolution crystal structure of the 
20 hERa LBD bound to DES and the coactivator and the 1 .9A x-ray crystal structure of the hERa 
LBD bound to OHT, i.e.. the crystals defract with at least 2.03A or 1 .9A resolution, respectively. 

In yet another aspect of the invention, compounds of interest are discovered, i.e., agonists 
or antagonists of ligand binding are identified, by a method for identifying an agonist or 
antagonist of ligand binding to a nuclear receptor. The method comprises the steps of providing 
25 the atomic coordinates of the ERa LBD or portion thereof to a computerized modeling system, 
modeling compounds which fit spatially into the LBD, and identifying in an assay for nuclear 
receptor activity a compound which increases or decreases the activity of the nuclear receptor by 
binding the LBD of the nuclear receptor. Preferably, the atomic coordinates are of the amino 
acid residues corresponding to residues of human estrogen receptor a Met343. Leu346, AIa350, 
30 Glu353, Leu384. Leu387. Leu39l. Arg394. Phe404. Met421. Leu428, Gly521. His524, Leu525 
and Met528, preferably Met343. Met421. His524. Leu525 and Met528. 
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Compounds of particular interest fit spatially and preferentially into the ligand binding 
domain. By "its spatially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a nuclear receptor LBD. 
Compounds that fit spatially and preferentially into the LBD interact with amino acid residues 
5 forming the hydrophobic cleft of this site. The present invention also includes a method for 

identifying a compoimd capable of selectively modulating nuclear receptor activity. The method 
comprises the steps of modeling test compounds that fit spatially and preferentially into the LBD 
of a nuclear receptor of interest using an atomic structural model of a nuclear receptor, screening 
the test compoimds in an assay for nuclear receptor activity characterized by preferential binding 

10 of a test compound to the LBD of a nuclear receptor, and identifying a test compound that 

selectively modulates the activity of a nuclear receptor. Such receptor-specific compounds are 
selected that exploit differences between the LBDs of one type of nuclear receptor versus a 
second type of nuclear receptor. 

The invention also is applicable to generating new compounds that distinguish nuclear 

1 5 receptor isoforms. This can facilitate generation of either tissue-specific or function-specific 
compounds. For instance, GR subfamily members have usually one receptor encoded by a 
single gene, although there are exceptions. For example, there are two PR isoforms, A and B, 
translated from the same mRNA by alternate initiation from different AUG codons. There are 
two GR forms, one of which does not bind ligand. This method is especially applicable to the 

20 ER subfamily which usually has several receptors that are encoded by at least two (ER: a, P) 
genes or have alternate RNA splicing. 

The receptor-specific compounds of the invention preferably interact with 
conformationally constrained residues of the LBD that are conserved among one type of nuclear 
receptor compared to a second type of nuclear receptor. "Conformationally constrained" is 

25 intended to refer to the three-dimensional structure of a chemical or moiety thereof having 
certain rotations about its bonds fixed by various local geometric and physical-chemical 
constraints. Conformationally constrained structural features of a LBD include residues that 
have their natural flexible conformations fixed by various geometric and physical-chemical 
constraints, such as local backbone, local side chain, and topological constraints. These types of 

30 constraints are exploited to restrict positioning of atoms involved in receptor-coactivator 
recognition and binding. 
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The present invention also provides for a computational method using three dimensional 
models of nuclear receptors based on crystals of nuclear receptors. Generally, the computational 
method of designing a nuclear receptor ligand determines which amino acid or amino acids of a 
nuclear receptor LBD interact with a chemical moiety (at least one) of the ligand using a three 
5 dimensional model of a crystallized protein comprising a nuclear receptor LBD with a bound 
ligand, and selecting a chemical modification (at least one) of the chemical moiety to produce a 
second chemical moiety with a structure that either decreases or increases an interaction between 
the interacting amino acid and the second chemical moiety compared to the interaction between 
the interacting amino acid and the chemical moiety. In the instant invention, crystal structures of 

10 the hERa with DES/peptide and with OHT, have shown that amino acid residues that correspond 
to hERa Met343. Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394, Phe404, Met421, 
Leu428, Gly52K His524, Leu525 and Met528, preferably Met343, Met421, His524. Leu525 
and/or Met528, interact with at least one chemical moiety on the ligand. 

Accordingly, one embodiment of the invention is a computational method of designing a 

15 nuclear receptor ligand where at least one amino acid residue of a nuclear receptor LBD that 

corresponds to human estrogen receptor a Met343, Leu346, AIa350, Glu353, Leu384, Leu387, 
Leu391, Arg394, Phe404, Met421, Leu428, Gly521, His524, Leu525 and Met528, preferably 
Met343, Met42L His524, Leu525 and Met528, interacts with at least one first chemical moiety 
of the ligand, comprising the step of selecting at least one chemical modification of the first 

20 chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between the interacting amino acid and the second chemical moiety as 
compared to the interaction between the interacting amino acid and the first chemical moietv. 

This computational method may further comprise determining a change in interaction 
between the interacting amino acid and the ligand after chemical modification of the first 

25 chemical moiety. The chemical modification can either enhance or reduce hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or dipole 
interaction between the second chemical moiety and the interacting amino acid as compared to 
the interaction between the first chemical moiety and the interacting amino acid. 

Chemical modifications will often enh2mce or reduce interactions an atom of a LBD 

30 amino acid and an atom of an LBD ligand. Steric hindrance will be a common means of 
changing the interaction of the LBD binding cavity with the activation domain. Chemical 
modifications are preferably introduced at C-H, C- and C-OH position in ligands, where the 
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carbon is part of the ligand structure which remains the same after modification is complete. In 
the case of C-H, C could have 1 . 2 or 3 hydrogens, but usually only one hydrogen will be 
replace. The H or OH are removed after modification is complete and replaced with the desired 
chemical moiety. 

5 Such chemical modifications would preferably involve the addition of substituents, onto 

any of the free carbons of the A' ring of DES positioning these substituents to collide or bind 
preferentially with one or more residues that correspond to hERa Met343, Leu346, Ala350, 
Glu353, Leu384, Leu387, Leu39K Arg394, Phe404, Met42K Leu428, Gly52L His524, Leu525 
and Met528, preferably Met343, Met42L His524, Leu525 or Met528. Typical substituents are 

10 hydrophobic groups, including by way of example and not limitation, alkyl groups such as ethyl, 
propyl, isopropyl, etc., and aromatic groups such as benzyl, etc. 

In practice, one would start with a known ligand for the nuclear receptor of interest as the 
chemical backbone, upon which to base agonist/antagonist design. The knovm ligand would be 
modified as described above. For example 1 7P-estradiol is an endogenous ligand for the hERa. 

15 In the case of estradiol, positions of interest are C6a, C7a, C 1 2a, CI 5a, C 1 6a and C 1 7a. 

Modifications at one or more of these free carbons on 1 7p-estradiors backbone would affect the 
ligand's interactions with one or more of the Met343, Leu346, Ala350, Glu353, Leu384, Leu387, 
Leu391, Arg394, Phe404, Met42 1, Leu428, Gly521, His524, Leu525 and Met528, preferably 
Met343, Met421, His524, Leu525 and Met528 residues, either providing for enhancing 

20 interaction, which would be the basis for agonist design, or reduced interaction, which would be 
the basis for antagonist design. 

Other chemical backbones of other known ligands could be used in a similar manner. 
For example, other known agonists include diethylstilbestrol (synthetic), moxestrol (synthetic), 
mesohexestrol (synthetic), coumestrol (clover), A^-THC (cannabis), o,p-DDT (insecticide), 

25 zearalenone (fungal) and kepone (insecticide). Known estrogen receptor antagonists include the 
ICI series of modified steroids such as ICI 164,384 and EM800. Knovm SERM*s include 
tamoxifen, raloxifene and GW5638. 

Alternatively known agonists could be positioned in the ligand binding pocket through 
computational or manual docking. Positions for substitution would then be selected based on the 

30 predicted binding orientation of these compounds. In addition, hybrid molecules could be 
generated that also possessed side chains that prevented helix 12 from adopting the agonist- 
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bound position. Novel SERMs can be produced by varying the strength of two different effects: 
the helix 12 displacement and the secondary structure disorganization. 

Previous efforts to make antagonists have involved attachment of large bulky substituents 
to agonists, typically through trial and error, and the drug design methods described herein 
provide an alternative strategy of ligand design that may be critical for developing new potential 
antagonists. 

For modeling, docking algorithms and computer programs that employ them can be used 
to identify compounds that fit into the ligand binding domain. For example, docking programs 
can be used to predict how a molecule of interest can interact with the nuclear receptor LBD. 
Fragment-based docking also can be used in building molecules de novo inside the LBD, by 
placing chemical fragments that complement the site to optimize intermolecular interactions. 
The techniques can be used to optimize the geometry of the binding interactions. This design 
approach has been made possible by identification of the LBD structure thus, the principles of 
molecular recognition can now be used to design a compound which is complementary to the 
15 structure of this site. Compotmds fitting the LBD serve as a starting point for an iterative design, 
synthesis and test cycle in which new compounds are selected and optimized for desired 
properties including affinity, efficacy, and selectivity. For example, the compounds can be 
subjected to addition modification, such as replacement and/or addition of R-group substituents 
of a core structure identified for a particular class of binding compounds, modeling and/or 
20 activity screening if desired, and then subjected to additional rotmds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a nuclear receptor ligand binding domain of 
interest. In this screening, the quality of fit of such entities or compounds to the binding site may 
be judged either by shape complementarity (DesJalais. et al., J. Med. Chem . 3 1 :722-729 (1988)) 
25 or by estimated interaction energy (Meng, et al., J. Comp . Chem . 13:505-524 (1992)). The 
molecule databases include any virtual or physical database, such as electronic and physical 
compound library databases, and are preferably used in developing compounds that modulate 
coactivator binding. 

Compounds can be designed intelligently by exploiting available structural and 
30 functional information by gaining an imderstanding of the quantitative structure-activity 

relationship (QSAR), using that understanding to design new compound libraries, particularly 
focused libraries having chemical diversity of one or more particular groups of a core structure, 
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and incorporating any structural data into that iterative design process. For example, one skilled 
in the an may use one of several methods to screen chemical entities or fragments for their 
ability to associate with the ligand binding domain of a nuclear receptor of interest. This process 
may begin by visual inspection of. for example, the LBD on the computer screen. Selected 
fragments or chemical entities may then be positioned into all or part of the site. Docking may 
be accomplished using software such as Quanta and Sybyl, followed by energy minimization and 
molecular dynamics with standard molecular mechanics force-fields, such as CHARMM and 
AMBER. 

Specialized computer programs may also assist in the process of selecting chemical entity 

fragments or whole compounds. These include: GRID (Goodford, J. Med. Chem. 28:849-857 

(1985), available from Oxford University, Oxford, UK); MCSS (Miranker, etal., "Proteins: 

Structure. Function and Genetics" 11:29-34 (1991), available from Molecular Simulations, 

Burlington. MA); AUTODOCK (Goodsell, et al., "Proteins: Structure. Function and Generics" 

8:195-202 (1990), available from Scripps Research Institute, La Jolla, CA); and DOCK (Kuntz, 

et al, J. Mol. Bioi. 161 :269-288 (1982), available from University of California, San Francisco ' 
CA). 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem . Des. 
Auto . News 8:44-47 (1993)). 

Once suitable chemical entities or fragments have been selected, they can be assembled 
into a single compound. Assembly may be proceeded by visual inspection of the relauonship of 
the fragments to each other on the three-dimensional image displayed on a computer screen in 
relation to the structure coordinates of a nuclear receptor. This can be followed by manual 
model building using software such as Quanta or Sybyl. 

Useftil programs to aid one of skill in the art in connecting the individual chemical 
entiries or fragments include: CAVEAT (Bartlett, et al., "CAVEAT: A Program to Facilitate the 
Stmcture-Derived Design of Biologically Active Molecules", in Molecular Recognition in 
Chemical and Biological Problems, Special Pub., Royal Chem. Soc. 78:182-196 (1989), 
available from the University of California. Berkeley, CA); 3D Database systems such as 
MACCS-3D (MDL Information Systems. San Leandro. CA. reviewed in Martin. J. Med. Chem. 
35:2145-2154 (1992)); and HOOK (available from Molecular Simulations. Burlington. MA). 
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In addition to building a compound in a step-wise fashion, one fragment or chemical 
entity at a time as described above, compounds that bind to a ligand binding domain of interest 
also may be designed as a whole or de novo using either an empty LBD or optionally including 
some ponion(s) of a molecule known to bind to the site, such as a known ligand. These methods 
include: LUDI (Bohm, J. Comp . Aid. Molec . Design 6:61-78 (1992), available from Biosm 
Technologies. San Diego, CA); LEGEND (Nishibata. etal.. Tetrahedron 47:8985 (1991), 
available from Molecular Simulations. Burlington. MA); and Leapfrog (available from Tripos 

Associates. St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance with this 
invention. See, for example. Cohen, et al., J. Med. Chem. 33:883-894 (1990) and Navia, et al., 
Current Opinions in Structural Biology 2:202-210 (1992). For example, where the structures of 
test compounds are known, a model of the test compound may be superimposed over the model 
of the structure of the invention. Numerous methods and techniques are known in the art for 
performing this step, any of which may be used. See, for example. Farmer, "Drug Design" 
1 5 10: 11 9-1 43, Ariens, ed.. Academic Press. New York (1980); U.S. Patent No. 5,331,573; U.S. 
Patent No. 5,500,807; Verlinde, Structure 2:577-587 (1994); and Kuntz, et al.. Science 
257:1078-1082 (1992). The model building techniques and computer evaluation systems 
described herein are not a limitation on the present invention. 

Using these computer modeling systems a large number of compounds may be quickly 
20 and easily examined and expensive and lengthy biochemical testing avoided. Moreover, the 
need for actual synthesis of many compounds can be substantially reduced and/or effectively 
eliminated. 

Compounds identified through modeling can be screened in assays such as are well 
known in the art. Such assays, which include biological assays, are characterized by binding of 

25 the compound to a ligand binding domain of interest for ligand binding activity. Screening can 
be, for example, in vitro, in cell culture, and/or in vivo. Biological screening preferably centers 
on activity-based response models, binding assays (which measure how well a compound binds 
to the receptor), and bacterial, yeast and animal cell lines (which measure the biological effect of 
a compound in a cell). The assays can be automated for high capacity-high throughput screening 

30 (HTS) in which large numbers of compounds can be tested to identify compounds with the 
desired activity. 
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As an example, in vitro binding assays can be performed in which compounds are tested 
for their ability to block the binding of a ligand, fragment, fusion or peptide thereof, to a lieand 
binding domain of interest. For cell and tissue culture assays, they may be performed to assess a 
compound's ability to block function of cellular coacti vators. such as members of the p 1 60 
family of coacti vator proteins, such as SRC-l. AIBl. RAC3. p/CIP, and GRIPl and its 
homologues TIF 2 and NcoA-2, and those that exhibit receptor and/or isoform-specific bindinc 
affinity. In a preferred embodiment, compounds of the invention bind to a ligand binding 
domain with greater affinity than the endogenous ligands. Tissue profiling and appropriate 
animal models also can be used to select compounds. Different cell types and tissues also can be 
used for these biological screening assays. Suitable assays for such screening are described 
herein and in Shibata. et al.. Recent Prog. Horm . Res. 52:141-164 (1997); Tagami. et al., Mol- 
Ceil. Bioi. 17(5):2642-2648 (1997); Zhu, et al., J. Bjoi. Chem . 272(1 4):9048-9054 (1997); Lin, 
et al., Mol. Cell. Bioi- 17(I0):6131-6138 (1997); Kakizawa, et al., J. Biol. Chem . 272(38):23799- 
23804 (1997); and Chang, et al., Proc . Natl. Acad . Sci . USA 94(17):9040-9045 (1997), which 
1 5 references are incorporated herein in their entirety by reference. 

The compoimds selected can have agonist and/or antagonistic properties. The 
compounds also include those that exhibit new properties with varying mixtures of agonist and 
antagonist activities, depending on the effects of altering ligand binding in the context of 
different activities of nuclear receptors, either hormone-dependent or hormone-independent, 
which are mediated by proteins other than coactivators. and which interact with the receptors at 
locations other than the coactivator binding site. The compounds also include those, which 
through their binding to receptor locations that are conformationally sensitive to hormone 
binding, have allosteric effects on the receptor by stabilizing or destabilizing the hormone-bound 
conformation of the receptor, or by directly inducing the same, similar, or different 
25 conformational changes induced in the receptor by the binding of hormone. 

Of particular interest is use of such compounds in a method of modulating nuclear 
receptor activity in a mammal by administering to a mammal in need thereof a sufficient amount 
of a compound that fits spatially and preferentially into a ligand binding domain of a nuclear 
receptor of interest. By "modulating" is intended increasing or decreasing activity of a nuclear 
receptor. For example, pre-clinical candidate compounds can be tested in appropriate animal 
models in order to measure efficacy, absorption, pharmacokinetics and toxicity foUowina 
standard techniques known in the art. Compounds exhibiting desired properties are then tested 



20 



30 



23 



BNSDOCtD: <WO. 



.9e5065aA2_L> 



wo 99/50658 PCT/US99/06937 

in clinical trials for use in treatment of various nuclear receptor-based disorders. These include 
ER-based disorders, such as postmenopausal symptoms and cancer resulting from loss of 
estrogen production, and osteoporosis and cardiovascular disease stemming from traditional 
estrogen replacement therapy. Others include GR-based disorders including Type il diabetes 

5 and inflammatory conditions such as rheumatic diseases. 

Although for many nuclear receptors, the goal is to discover novel synthetic agonists or 
antagonists, it is important to realize the value for some nuclear receptors, especially the estrogen 
receptor, of developing compounds that have the desired agonist and antagonist effects in target 
tissues. Such compounds can be discovered and/or designed by the methods described herein, 

10 then screened for tissue specificity by methods that are well knowTi in the art. For example, there 
is a great need for the improvement of existing therapies and the development of new agonists 
that act like estrogen in cardiovascular and brain tissue and bone, and new antagonists that act 
upon the estrogen receptor in uterine and breast tissue. Ideally, a compound will have more than 
one of these traits, i.e., a compound will act as an agonist in one tissue, while acting as an 

1 5 antagonist in another tissue. While the tissue-selective antagonism of SERMs such as OHT and 
RAL is the resuh of numerous factors (Grainger, et aL, Nature Medicine 2(4):38 1-385 (1996); 
Grese, et al., supra : and Jordan, J. Natl . Cancer Inst . 90:967-71 (1998)), dissection of the 
mechanisms of action of these ligands requires a comprehensive understanding of how they act 
on the LBD and regulate its interactions with other cellular factors. The instant invention shows, 

20 unexpectedly, that ligand-mediated structural perturbations in and around the ligand binding 
pocket, and not simply side chain effects, contribute to receptor antagonism. Accordingly, by 
adjusting the balance between these two effects provides a novel strategy for the design of 
improved SERMs. 

With this knowledge, it is of particular interest to design therapeutic compoimds that will 
25 distort at least one amino acid residue corresponding to residues of human estrogen receptor a 

Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu391, Arg394, Phe404, Met421, Leu428, 
Gly521. His524, Leu525 and Met528, preferably Met343, Met42L His524, Leu525 and Met528. 
Accordingly, one aspect of the invention is a method of modulating nuclear receptor activity in a 
mammal by administering to a mammal in need thereof a sufficient amount of a ligand that fits 
30 spatially and preferentially into a ligand binding domain of a nuclear receptor of interest, 

wherein the ligand is designed by a computational method where at least one amino acid residue 
of a nuclear receptor LBD that corresponds to hERa Met343, Leu346. Ala350. Glu353, Leu384, 
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Leu387. Leu39l. Arg394. Phe404. Met421. Leu428. Gly521. His524, Leu525 and Met5^8 
preferably Met343. Met42i. His524, Leu525 and Met528. interacts with at least one first 
chemical moiety ot the ligand. Such a method involves selecting at least one chemical 
modification of the first chemical moiety to produce a second chemical moiety with a structure 
that either decreases or increases an interaction between the interacting amino acid and the 
second chemical moiety as compared to the interaction between the interacting amino acid and 
the first chemical moiety. 

Compounds designed by this method can be either agonists or antagonists and the method 
of modulating nuclear receptor activity can comprise administering an antagonist alone an 
agomst alone or an agonist in combination with a coactivator or a compound that mimics a 
coactivator by binding to the coactivator binding site. 

The coactivator can be a known coactivator. The coactivator mimic can be designed by a 
computational method where at least one amino acid residue of a nuclear receptor coactivator 
bmdmg s,te that corresponds to hERa helix 3 residues Leu354, Val355, Met357, Ile358 Ala36I 
and Lys362, helix 4 residue Phe367, helix 5 residues Gln375, Val376, Leu379 and Glu380 helix 
6 residue Trp383. and helix 12 residues Asp538, Leu539. Glu542, Met543 and Leu544 interacts 
w.th at least one first chemical moiety of the coactivator mimic. The method involves selecting 
at least one chemical modification of the first chemical moiety to produce a second chemical 
moiety with a structure that either decreases or increases an interaction between the interacting 
ammo acid and the second chemical moiety as compared to the interaction between the 
mteractmg amino acid and the first chemical moietv. 

Use of an agonist in combination with a coactivator or coactivator mimic also provides a 
unique strategy for delivering therapeutics that have novel tissue-specific effects. For example 
coactivator mimics can be designed to bind into the site involved in transcriptional activity only 
when hel.x-12 is in its agonist bound state. If such coactivator mimics are specific for this site of 
a particular receptor, it is possible to selectively inhibit that receptor only in the presence of 
agonist. This could lead to novel, tissue specific antagonism based on the levels of endogenous 
agonists. Agonists designed by the methods of the instant invention could be used in assay to 
determine the specificity of coactivator mimics. Alternatively, the effective levels in a given 
tissue could be modulated by giving known antagonists or antagonists designed bv the methods 
of the instant invention. The crystal structure of the LBD/DES/GRIPl peptide complex 
described herein, precisely defines the binding site that would need to be targeted. 
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As noted above and as exemplified in the Examples, ER LBDs are co-crystallized with a 
peptide molecule comprising a coactivator GRIPI NR Box II peptide sequence (SEQ ID N0:4) 
bound to the coactivator binding site and DES with the cocrystal structure refined to a resolution 
of 2.03A and co-crystallized with OHT with the cocrystal structure refined to a resolution of 
1.9A. Accordingly, the invention also provides for cocrystals made from nuclear receptor ligand 
binding domains with a ligand bound to the ligand binding domain and a molecule bound to the 
coactivator binding site. Preferably the cocrystal structure is refined to a resolution greater than 
3.6A. i.e.. having a resolution value less than 3.6A. More preferably the cocrystal structure is 
refined to greater than 3.4A, 3.2A, 3.0A. 2.8A, 2.6A, 2.4A, 2.2A, even more preferably to a 
resolution greater than 2.03 A. The invention ftirther provides for cocrystals made from nuclear 
receptor ligand binding domains with a ligand bound to the ligand binding domain. Preferably 
the cocrystal structure is refined to a resolution greater than 3.6A. i.e., having a resolution value 
less than 3. 6 A. More preferably the cocrystal structure is refined to greater than 3. 4 A, 3. 2 A, 
3. OA, 2.8A, 2.6A, 2.4A, 2.2A, 2.0A, even more preferably to a resolution greater than 1 .9A. 

Crystals are made from purified nuclear receptor LBDs that are usually expressed by a 
cell culture, such as E. coli. E. coli is often a preferred expression system. The thyroid receptor 
was successfully expressed in E. coli in Apriletti, et al., supra . However, it has long been 
believed that a human heat shock protein was required for successful recombinant expression of 
the estrogen receptor. Therefore, it was quite unexpected to find that the estrogen receptor could 
be expressed as an active protein in E. coli. 

Preferably, different crystals (cocrystals) for the same nuclear receptor are separately 
made using different coactivators-type molecules, such as protein fragments, fusions or small 
peptides. The coactivator-type molecules preferably contain NR-box sequences necessary for 
binding to the coactivator binding site, or derivatives of NR-box sequences. Other molecules 
can be used in co-crystallization, such as small organics that bind to the coactivator or hormone 
binding site(s). Heavy atom substitutions can be included in the LBD and/or a co-crystallizing 
molecule. 

After the three dimensional structure of the cocrystal is determined, the structural 
information can be used in computational methods to design synthetic compounds for the nuclear 
receptor, and further structure-activity relationships can be determined through routine testing 
using the assays described herein and known in the art. 
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Since nuclear receptor LBDs may crystallize in more than one crystal form the structure 
coordinates of such receptors or ponions thereof, as provided in Appendices 1 and 2 are 
particularly useful to solve the structure of those other ctystal forms of nuclear receptors Thev 
may also be used to solve the structure of mutants or co-complexes of nuclear receptors having 
sufficient homology. 

One method that may be employed for this purpose is molecular replacement. In this 
method, the unknown crystal structure, may be determined using the structure coordinates of this 
invention as provided in Appendices 1 and 2. The Appendix 1 coordinates for the DES-ERa 
LBD-GRIPl NR Box II peptide complex and for the Appendix 2 coordinates for the OHT-ERa 
LBD complex have been deposited with the Brookhaven National Laboratory Protein Data 
Bank, and have been assigned Brookhaven Protein Data Bank Accession Numbers 2erd and 2ert 
respectively. This method will provide an accurate structural form for the unknown c^'stal more 
quickly and efficiently than attempting to determine such information ab initio. 

Atomic coordinate information gleaned from the crystals of the invention can be stored 
In a preferred embodiment, the information is provided in the form of a machine-readable data 
storage medium. This medium contains information for constructing and/or manipulating an 
atomic model of a ligand binding domain or portion thereof For example, the machine readable 
data for the ligand binding domain comprises structure coordinates of amino acids corresponding 
to hERa Met343, Leu346, Ala350. Glu353, Leu384, Leu387, Leu391, Arg394. Phe404, Met42I 
Leu428, GIy521, His524, Leu525 and Met528, preferably Met343, Met421, His524. Leu525 and 
Met528, or a homologue of the molecule or molecular complex comprising the site The 
homologues comprise a LBD that has a root mean square deviation from the backbone atoms of 
the amino acids of not more than 1 .5k. A preferred molecule or complex represents a compound 
bound to the LBD. 

The machine-readable data storage medium can be used for interactive drug design and 
molecular replacement studies. For example, a data storage material is encoded with a first set 
of machine-readable data that can be combined with a second set of machine-readable data For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion 
of the structural coordinates of the nuclear receptor or portion thereof of interest, and the second 
data set comprises an X-ray diffraction pattern of the molecule or molecular complex of interest 
Using a machine programmed with instructions for using the first and second data sets a portion 
or all of the structure coordinates corresponding to the second data can be detemiined 
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Protein for crystals and assays described herein can be produced using expression and 
purification techniques described herein and known in the art. For example, high level 
expression of nuclear receptor LBDs can be obtained in suitable expression hosts such as E. coli. 
Expression of LBDs in E. coli. for example, includes the ERa LED and other nuclear receptors. 
5 including GR and PR. Yeast and other eukaryotic expression systems can be used with nuclear 
receptors that bind heat shock proteins as these nuclear receptors are generally more difficult to 
express in bacteria, with the exception of ER. which can be expressed in bacteria. 
Representative nuclear receptors or their ligand binding domains have been cloned and 
sequenced: human ER (as described in Seielstad, et al.. Molecular Endocrinolosv 9(6):647-658 
10 (1 995), incorporated herein by reference), human GR, and human PR. The LED for each of 

these receptors has been identified. 

Coactivator proteins can be expressed using techniques known in the art, particularly 
members of the pi 60 family of coactivator proteins that have been cloned and/or expressed 
previously, such as SRC-1, AIBI, RAC3, p/CIP, and GRIPl and its homologues TIF 2 and 
1 5 NcoA-2. A preferred method for expression of coactivator protein is to express a fragment that 
retains transcriptional activation activity using the "Song and Fields" method (also referred to as 
the "yeast 2-hybrid" method) as described in publications by Hong, et al., Mol. Cell. Biol- 
17:2735-44 (1997) and Proc. Natl. Acad. Sd. IJSA 93(10):4948-52 (1996)), for GRIPl 
expression, which references are incorporated herein by reference. 
20 The proteins can be expressed alone, as fragments of the mature or full-length sequence, 

or as ftisions to heterologous sequences. For example, ERa can be expressed without any 
portion of the DED or amino-terminal domain. Portions of the DED or amino-terminus can be 
included if fiirther structural information with amino acids adjacent the LED is desired. 
Generally, for the ERa the LBD used for crystals will be less than 320 amino acids in length. 
25 Preferably, the ERa LBD will be at least 220 amino acids in length and most preferably at least 
250 amino acids in length. For example the LBD used for ctystallization can comprise amino 
acids spanning from 297 to 554 of the ERa. 

Typically the LBDs are purified to homogeneity for crystallization. Purity of LBDs can 
be measured with sodium dodecyl sulfate polyacryl amide gel electrophoresis (SDS-PAGE). 
30 mass spectrometr\' (MS) and hydrophobic high performance liquid chromatography (HPLC). 
The purified LBD for crystallization should be at least 97.5 % pure, preferably at least 99.0% 
pure, and more preferably at least 99.5% pure. 
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Initially, puriHcation of the unliganded receptor can be obtained by conventional 
techniques, such as hydrophobic interaction chromatography (HPLC). ion exchange 
chromatography (HPLC), and heparin affinity chromatography. To achieve higher purification 
for .mproved crystals of nuclear receptors, especially the estrogen receptor, the receptors can be 
hgand-shift-purified using a column that separates the receptor according to charge, such as an 
ion exchange or hydrophobic interaction column, and then bind the eluted receptor with a ligand 
especially an agonist. The ligand induces a change in the receptor's surface charge such that the 
hganded receptor elutes at a different position than the unliganded receptor. Usually saturating 
concentrations of ligand are used in the column and the protein can be preincubated with the 
Ugand prior to passing it over the column. The structural studies detailed herein indicate the 
general applicability of this technique for obtaining super-pure nuclear receptor LBDs for 
crystallization. 

Purification can also be accomplished by use of a purification handle or "tag." such as 
with a histidine amino acid engineered to reside on the end of the protein, such as on the N- 
tenninus, and then using a nickel or cobalt chelation column for purification. (Janknecht Proc 
M. Acad. Sci. USA, 88:8972-8976 (1991)) incorporated by reference. Typically purified 
LBD, such as ERa LBD, is equilibrated at a saturating concentration of ligand at a temperature 
that preserves the integrity of the protein. Ligand equilibration can be established between 2 and 
37°C, although the receptor tends to be more stable in the 2-20°C range. Preferably crystals are 
made with the hanging drop methods detailed herein. Regulated temperature control is desirable 
to improve crystal stability and quality. Temperatures between 4 and 25°C are generally used 
and it is often preferable to test crystallization over a range of temperatures. The crystals are 
then subjected to vapor diffusion and bombarded with x-rays to obtain x-ray diffraction pattern 
following standard procedures. 

For co-crystallization with a ligand that binds the ligand binding domain, alone or in 
conjunction with a peptide that binds to the coactivator binding site, various concentrations of 
hgands and peptides containing a sequence that binds to a coactivator binding site of a nuclear 
receptor of interest can be used in microcrystallization trials, and the appropriate compounds 
selected for further crystallization. Ligands and peptides can be assayed for binding to the ligand 
binding domain and coactivator binding sites of a nuclear receptor of interest by any number of 
techniques, including those assays described herein. For crystallization trials with the ERa 
LBD. the hanging drop vapor diffusion method is preferred. Conditions of pH. solvent and 
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solute components and concentrations and temperature can be adjusted, for instance, as 
described in the Examples. In the handing drop method, to obtain suitable crystals for x-ray 
diffraction analysis, seeding of prepared drops with microcrystals of the complex can be used. 
Collection of structural information can be determined by molecular replacement using the 
structure of the ERa LBD determined herein. The structure is refined following standard 

techniques known in the art. 

There are many uses and advantages provided by the present invention. For example, the 
methods and compositions described herein are useful for identifying peptides, peptidomimetics 
or small natural or synthetic organic molecules that modulate nuclear receptor activity. The 
1 0 compounds are useful in treating nuclear receptor-based disorders. Methods and compositions of 
the invention also find use in characterizing structure/function relationships of natural and 
synthetic ligands. 

The following discussion of provides an understanding of the basis for the examples, 
along with the results obtained and/or the conclusions reached. 
1 5 As described above, many coactivators recognize agonist bound nuclear receptor LBDs 

through the sequence LXXLL (SEQ ID NO:l), where L is leucine and X is any amino acid (the 
NR box). The structure of the DES-hERa LBD-GRIPl peptide complex reveals that the 
LXXLL motif (SEQ ID NO: 1 ) forms the core of a short amphipathic a helix which is recognized 
by a highly complementary hydrophobic groove on the surface of the receptor. In agreement 
20 with the conclusions of other mutational and structural studies (Brzozowski, et al.. supra and 
Feng, et al., supra), it is believed that this peptide binding groove formed by residues from 
helices 3, 4, 5 and 12 and the turn between helices 3 and 4 is the surface of ERa involved with 
transcriptional activity, i.e., the coactivator binding site. Further, structural studies of the 
complex between TRp and the GRIPl NR box II peptide and biochemical studies of GRIP 1 
25 binding to TRp and GR (Darimont, et al., supra ) and study of the general features of the 

PPARy/SRC-l peptide complex (Nolte, et al., supra ) are similar to those of the ERa/GRIPl NR 
box II peptide complex described herein, suggesting that the mechanisms of NR box recognition 
are conserved across the nuclear receptor family. 

Of the eleven AF-2 residues whose side chains interact with the ciDactivator helix (Figure 
30 3A), only four (Lys 362. Leu 379. Gin 375 and Glu 542) are highly conserved across the nuclear 
receptor family (Wurtz. et al., supra V The side chains of Gin 375 and Leu 379 are 
predominantly buried even in the absence of GRIPl binding and appear to form integral parts of 
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the architecture of the surface involved in transcriptional activity. In contrast, the side chains of 
Lys 362 and Glu 542 are largely solvent exposed in the absence of coactivator and make both 
nonpolar contacts and the only receptor-mediated polar interactions with the coactivator helix 
These two capping interaction residues are perfectly positioned at opposite ends of the 
^ coactivator binding site groove not only to stabilize the main chain conformation of the 

coactivator but also to function as a molecular caliper; the 15A distance between Lys 362 and 
Glu 542 is well suited to measure off the -1 1 A axial length of the short, two-tun) coactivator a 
hehx (Figure 3C). Similar receptor-mediated capping interactions have also been observed in a 
complex between the TRp LBD and the NR box II peptide (Darimont, et al., Mutation of 

1 0 either of these two residues severely cripples coactivator binding by ERa as well as by TRp (see 
the Examples, Apriletti. et al.. suEia and Feng, et al.. suEra and Henttu. et al.. suem) Hence the 
formation of helix capping interactions may be a general feature of coactivator recognition by 
nuclear receptors. 

The side chains of six hydrophobic AF-2 residues (He 358, Leu 372, Val 376 Leu 379 
1 :> Leu 53 9 and Met 543) form a large part of the highly cooperative network of van der Waals ' 

contacts made by the receptor with the hydrophobic face of the coactivator helix (Figures 3A and 
3C). Mutations in He 358, Val 376 and Leu 539 abrogate GRIPl binding (see the Examples and 
Feng, et al., suera). Although these residues are. in general, more poorly conserved across 
nuclear receptors than either Lys 362 or Glu 542, their hydrophobic character, with the exception 
-0 of Leu 372, is conserved. Since the different NR LBDs adopt the same overall fold (Moras et 
al- CuiT. Qnin. CeH Biol. 10:384-91 (1998)), it follows that the hydrophobic regions of different 
nuclear receptor coactivator binding site surfaces are distinctly textured. In support of this 
hypothesis, the NR box II peptide used in crystallization inhibited binding of GRIPl to the LBDs 

of the ERa, the TRp and the glucocorticoid receptor (GR) with ..ry different efficiencies (Ding 
25 et al.. sunra V ^' 

The hydrophobic face of the NR box helix is formed by the side chains of the three motif 
ieucmes and the isoleucine preceding the motif (lie 689). The functional importance of the 
conserved leucines in receptor binding has been demonstrated by numerous in ruro and in .i.o 
studies, m contrast, the role of the residue preceding the motif in receptor binding has been 
>0 pooriy characterized. Both biochemical and structural data implicate He 689 as a key receptor 
bmdmg detemiinant. In the crystal, only the side chains of the motif leucines and He 689 
extensively contact the LBD in both nonciystallographic symmetry related peptides. Mutation of 
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lie 689 to alanine reduces the ability of the peptide to inhibit the binding of GRIP! to ERa by 
-30 fold in a competition assay (data not shown). The side chain of He 689 lies in a rather 
chemically distinct environment; this residue forms van der Waals contacts with the aliphatic 
portion of the Asp 538 side chain, the side chain of Leu 539 and the y-carboxylate of Glu 542 
(Figures 1 A and 3 A). The proximity of this negatively charged moiety of Glu 542 to the 
hydrophobic side chain of He 689 should enhance the electrostatic potential of the side chain 
carboxylate and strengthen its stabilizing interactions with the N-terminus of the coactivator 
helix. Despite its apparently crucial role in receptor recognition, the identity of the residue 
immediately preceding knovm NR boxes is poorly conserved. This sequence variability should 
have effects not only on packing interactions with ERa but also on both the chemical 
environment and the critically important orientation of Glu 542. This should in turn translate 
into variations in affinity for the receptor. Indeed, the three NR boxes from GRIPl . which each 
contain a different residue preceding the LXXLL motif (SEQ ID NO:l), have differing affinities 
for ERa (Ding, et al., supra : Voegel, et al.. EMBO J. 17:507-19 (1998)). 
1 5 The NR boxes of TIF2, the human homologue of GRIP 1 , have been found to be partially 

functionally redundant despite their individual differences in receptor binding affinities. All 
three of NR boxes of TIF2 must be mutated to completely eliminate interaction with the ERa 
(Voegel, et al., EMBO J. 17:507-19 (1998)). Our data indicate that a single NR box peptide is 
sufficient to form a tight complex with a single ERa LED. Yet pi 60 coactivators possess 
20 multiple NR boxes. A possible explanation for the presence of multiple NR boxes is that they 

provide coactivators with broad specificity. Receptor binding relies upon the intricate formation 
of multiple van der Waals interactions yet the various nuclear receptors appear to have different 
coactivator binding site surfaces. The different amino acids in the position immediately 
preceding the LXXLL motif (SEQ ID N0:1) might allow some degree of adaptability to these 
25 distinct surfaces; however, there may be no NR box sequence that is capable of efficiently 
binding to all nuclear receptors. Multiple NR boxes may therefore provide coactivators the 
diversity of interfaces necessary to recognize a variety of targets. 

ERa transcriptional activity is blocked by antagonists such as OHT and RAL. The most 
striking feature of the structures of the OHT and RAL liganded ERa LBDs is that helix 12 is 
30 bound to the static region of the coactivator recognition groove (Figure 3B and (Brzozowski, et 
al., supra ). A comparison of these two structures with the structure of the coactivator/LBD 
complex reveals that in the antagonist complexes, the region of helix 12 with an NR box-like 
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sequence (LXXML versus LXXLL) (SEQ.ID NO:2 versus SEQ ID NO: 1) functions as an 
intramolecular mimic of the coactivator helix (Figure 5 and Brzozowski. et al., supra V 
Consistent with the proposals of others (Brzozowski, et al.. suEra and Darimont. et al.. sufira), 
thi..; disposition of helix 12 directly affects the structure and function of the surface responsible 
5 for transcriptional activity in two ways. First, because helix 1 2 residues form an integral part of 
the coactivator binding site surface, the surface is incomplete when helix 12 is in the antagonist- 
bound conformation. In particular. Leu 539, Glu 542 and Met 543 are incorrectly oriented for 
coactivator recognition. Second, residues from the static region of this surface are bound to helix 
12 and are prevented from interacting with coactivator (Figures 3 A and 33). 
10 The sequence similarity of helix 12 of the ERa LED to the LXXLL motif (SEQ ID 

NO: 1) appears to be unique among nuclear receptors. The' identities of the residues in this region 
of helix 12 in other nuclear receptors, although generally hydrophobic in character, do not as 
closely resemble the sequence of an NR box as those of ERa (Wurtz. et al., suEm). However, it 
is possible that an intramolecular inhibitor with a suboptimal recognition sequence would 
1 5 compete for coactivator binding given its extremely high local concentration. 

The binding of OHT to the ERa promotes a helix 12 conformation that inhibits binding 
of the coactivator. OHT does not directly interact with any helix 12 residues (Figure 4B). 
Moreover, the structure of the LED in the region of the coactivator binding site groove that 
interacts with helix 12 in the OHT complex is the same in the DES and E^ complexes (Figures 
20 3A, 3B and 5). 

Numerous studies have demonstrated the importance of the OHT side chain in receptor 
antagonism (Jordan, et al., suera and Robertson, et al., J. Med. Chem . 25:167-71 (1982)). A 
comparison of the structures of the OHT and DES complexes reveals that the binding mode of 
the OHT side chain precludes the agonist-induced conformation of helix 12. The OHT side 
chain projects out of the ligand binding pocket between helices 3 and 1 1 (Figures 2B, 6B and 
6C). As a result, the positioning of helix 12 over the ligand binding pocket, as it is in the 
agonist-bound conformation, would bury the positively charged dimethylamino group of the 
OHT side chain within a hydrophobic cavity and produce steric clashes between the 
dimethylaminoethyl region of side chain and the side chain of Leu 540. 

In functional terms. OHT is not, however, simply "an agonist with a side chain". OHT 
binding promotes a conformation of the LED that is distinct from that stabilized by either DES 
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or E: binding. These different conformations impose different restrictions on the positioning of 
helix 12. 

HeUces 3, 8 and 1 1 in the DES and E2 complexes are between one to two turns longer 
than they are in the OHT complex (Figure 6 A and (Brzozowski, et al., supra ). Helix 1 1 ends at 
5 Cys 530 in the DES and E2 complexes and it ends at Tyr 526 in the OHT complex. Helix 12 

begins at Leu 536 in the OHT complex. This appears to be necessary; in the antagonist complex. 
Leu 536 forms a cooperative network of nonpolar contacts and hydrogen bonds with Glu 380 
and Tyr 537 that stabilizes the N-terminus of helix 12 (Figure IB). Therefore, if helix 12 were to 
bind the static region of the coactivator binding site in the presence of agonist, the loop 
10 connecting helices 1 1 and 12 would be required to span -17A over five residues. Although 
theoretically possible, this conformation would be highly strained and hence unlikely. In 
contrast, the longer loop connecting helices 1 1 and 12 in the OHT complex allows helix 12 to 
extend to the static region of the coactivator binding groove. 

In the DES and E2 complexes, helix 12 and the loop connecting helices 1 1 and 12 pack 
15 against helices 3 and 11, whereas they do not in the OHT complex (Figures 2A and 2B and 

(Brzozowski, et al., supra V A recently described structure of the E2-LBD complex suggests that 
the longer helices in the DES and E2 complexes are not dependent upon the interactions helix 12 
forms in the agonist-bound conformation (Tanenbaum, et aL, supra ). In this structure, a crystal 
packing artifact forces helix 12 to contact a symmetry-related molecule. Helix 12 is clearly not 
20 positioned over the ligand binding pocket in this structure. Nevertheless, helices 3, 8 and 1 1 are 
longer than they are in the OHT complex (Figure 6A). Hence the longer helices of the agonist 
complexes occur independently of the positioning of helix 12 over the ligand binding pocket and 
are instead a direct result of agonist binding. 

The secondary structure differences between the agonist complexes and the OHT 
25 complex arise from distinct arrangements of packing interactions induced by the different 
ligands. A cooperative network of van der Waals contacts, organized around DES or E2, 
between various hydrophobic residues from helices 3, 7, 8 and 1 1 and the p hairpin appears to 
stabilize the longer helices in the agonist complexes (Figure 4 A and 6D). The placement of the 
OHT B ring forces many of ligand binding pocket residues that surround it to adopt 
30 conformations that are dramatically different from those they adopt in either the DES or E2 

structures. As a result, many of the interresidue packing interactions present in the DES and E2 
structures are either absent or altered in the OHT structure (Figure 6D). These structural 
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distortions apparently force the main chain from residues 339 to 34 L 421 to 423. and 527 to 530 
(which form pans of helices 3, 8 and U respectively in the agonist structures ) to adopt an 
extended conformation in the OHT structure (Figures 6A-D). 

Therefore the binding of OHT has two distinct effects on the positioning of helix 12. each 
of which contributes to antagonism (Figure 7). Helix 12 is prevented from being positioned over 
the ligand binding pocket by the OHT side chain. In addition, the alternative packing 
arrangement of ligand binding pocket residues around OHT stabilizes a conformation of the 
LBD that permits helix 12 to reach the static region of the coactivator binding site and mimic 
bound coactivator. 

These mechanisms do not appear to be specific to OHT. The side chain of RAL, like that 
of OHT, stericaliy hinders the agonist-bound conformation of helix 12 (Brzozowski, et al., 
suEIi). In addition, helix 1 1 appears to end at Met 528 in the RAL complex. This may result 
from the distortions in the binding pocket in the vicinity of His 524 directed by RAL binding 
(Brzozowski, et al., supra ). 

These results are supported by the experimental data provided in the examples below, 
which are also intended to illustrate various aspects of this invention. These examples do not 
limit the scope of this invention. 

EXAMPLES 

2Q Example 1 

Experimental Procedure-; 

Protein Expre ssion and Purification 

The human ERa-LBD 297-554 was overexpressed as described previously (Seielstad. et 
al.. Mol. Endocrinol . 9:647-658 (1995)) in BL21(DE3)pLysS cells transformed with a modified 
pET-23d-ERG vector that contained the sequence Met-Asp-Pro fused to residues 297 through 
554 of the hERa (provided by Paul Sigler of Yale University). Clarified bacterial lysates were 
adjusted to 3 M in urea and 0.7 M in NaCl and then applied to a 1 0-ml column of estradiol- 
Sepharose (Greene, et al.. Proc. UM- Acad. Sci- USA 77:51 15-51 19 (1980); Landel. et al.. Moi. 
Endocrinol. 8:1407-1419 (1994); Landel. et al., J. Steroid Biochem . Molec . BM- 63-59-73 
(1997)). 

To carboxymethylate the solvent-accessible cysteines, the bound hERa-LBD was treated 
with 5 mM iodoacetic acid in 1 0 mM Tris, pH 8. 1 . 250 mM NaSCN (Hegy, et al.. Steroids 
61:367-373 (1996)). Protein was eluted with 3 x 10-5 M ligand (either DES or OHT) in 30-100 
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ml of 50 mM Tris. 1 mM EDTA. 1 mM DTT and 250 mM NaSCN, pH 8.5. The yield of hERa- 
LBD was typically close to 100% (Seielstad, et al.. Biochemistry 34:12605-12615 (1995)). The 
affmiiy-purified material was concemrated and exchanged into 20 mM Tris. 1 mM EDTA. 4 mM 
DTT, pH 8T by ultrafiltration. The protein was bound to a Resource Q column (Pharmacia) and 
then eluted with a linear gradient of 25-350 mM NaCl in 20 mM Tris. pH 8.L 1 mM DTT. The 
hERa-LBD-ligand complexes eluted at 1 50-200 mM NaCI. Pooled fractions were concentrated 
by ultrafiltration and analyzed by SDS-PAGE, native PAGE, and electrospray ionization mass 
spectrometry. 
GST-pulldown Assays 

A fusion between glutathione-S-transferase (GST) and amino acids 282-595 of hERa 

was constructed by subcloning the EcoRI fi-agment from pSG5 ERa-LBD (Lopez et al., 
submitted manuscript) into pGEX-3X (Pharmacia), The He 358-> Arg, Lys 362->Ala, and Leu 
539->Arg mutations were introduced into the GST-LBD construct using the QuikChange Kit 
(Stratagene) according to the manufacturer's instructions. The Val 376->Arg and Glu 542->Lys 
mutations were created in the GST-LBD construct by subcloning the Bsml/Hindlll fragments of 
derivatives of pSG5-ER-HEGO (Tora. et al.. EMBO J. 8:1981-6 (1989)) into which these 
mutations had already been introduced. All constructs were verified by automated sequencing 
(University of Chicago Cancer Research Center DNA Sequencing Facility). 

The wild-type and mutant GST-LBDs were expressed in BL21(DE3) cells. Total ligand 
binding activity was determined by a controlled pore glass bead assay (Greene, et al., Mol. 
Endocrinol . 2:714-726 (1988)) and protein levels were monitored by western blotting with a 
monoclonal antibody to hERa (H222). Cleared extracts containing the GST-LBDs were 
incubated in buffer alone (50 mM Tris, pH 7.4, 150 mM NaCl, 2 mM EDTA, 1 mM DTT, 0.5% 
NP-40 and a protease inhibitor cocktail) or with 1 ^M of either DES or OHT for 1 hour at 4°C. 
Extract samples containing thirty pmol of GST-LBD were then incubated with 10 ^il glutathione- 
Sepharose-4B beads (Pharmacia) for I hour at 4°C. Beads were washed five times with 20 mM 
HEPES, pH 7,4, 400 mM NaCU and 0.05% NP-40. "S-labeled GRIPl was synthesized by in 
vitro transcription and translation using the TNT Coupled Reticulocyte Lysate System (Promega) 
according to the manufacturer's instructions and pSG5-GRIPl (provided by Michael Stallcup of 
the University of Southern California) as the template. Immobilized GST-LBDs were incubated 
for 2.5 hours with 2.5 ^1 aliquots of crude translation reaction mixture diluted in 300 ^il of Tris- 
buffered saline (TBS). After five washes in TBS containing 0.05% NP-40. proteins were eluted 
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by boiling the beads for 10 minutes in sample buffer. Bound ^'S-GRJPl was quantitated by 
fluorography following SDS-PAGE. 
Crystallization and Data Collection 

Crystals of the DES-hERa LBD-GRIPl NR Box II peptide complex were obtained by 
hanging drop vapor diffusion. Prior to crystallization, the DES-hERa LBD (residues 297-554) 
complex was incubated with a 2-4 fold molar excess of the GRIPl NR Box II peptide (SEQ ID 
NO:4) for 7-16 hr. Two ^L samples of this solution were mixed with equal volume samples of 
reservoir buffer consisting of 25-27% (w/v) PEG 4000. 90 mM Tris (pH 8.75-9.0) and 180 mM 
Na Acetate and suspended over wells containing 800 jiL of the reservoir buffer. After 4-7 days 
at 19-2 1 "C. rod-like crystals were obtained. The coactivator complex crystals lie in the 
spacegroup P2, with cell dimensions a=54.09, b=82.22. c=58.04 and p=l 1 1 .34. Two molecules 
each of the DES-LBD and the coactivator peptide form the asymmetric unit. A 200 ^im x 40 ^im 
X 40 urn crystal was transferred to a ciyosolvent solution containing 25% (w/v) PEG 4000, 1 0% 
(w/v) ethylene glycol, 100 mM Tris (pH 8.5), 200 mM Na Acetate and 10 ^M peptide and frozen 
in an N, stream at - 1 70°C in a rayon loop. Diffraction data from this crystal were measured at - 
1 70°C using a 300 mm MAR image plate at the Stanford Synchrotron Radiation Laboratory 
(SSRL) at beamline 7-1 at a wavelength of l.OSA. 

Crystals of the hERa LBD complexed to OHT were obtained by the hanging drop vapor 
diffusion method. Equal volume aliquots (2 ^L) of a solution containing 3.9 mg/mL protein- 
hgand complex and the reservoir solution containing 9% (w/v) PEG 8000, 6% (w/v) ethylene 
glycol, 50 mM HEPES (pH 6.7) and 200 mM NaCl were mixed and suspended over 800 mL of 
the reservoir solution. Hexagonal plate-like crystals formed after 4-7 days at 21-23°C. Both 
crystal size and quality were improved through microseeding techniques. These crystals belong 
to the space group P6522 with cell parameters a=b=58.24A and c=277.47A. The asymmetric 
unit consists of a single LBD monomer; the dimer axis lies along a crystallographic two-fold. A 
single crystal (400 ^xm x 250 ^m x 40 ^m) was briefly incubated in a cryoprotectant solution 
consisting of 10% (w/v) PEG 8000, 25% (w/v) ethylene glycol. 50 mM HEPES (pH 7.0) and 200 
mM NaCl and then flash frozen in liquid N2 suspended in a rayon loop. Diffraction data were 
measured at -1 70»C using a 345 mm MAR image plate at SSRL at beamline 9- 1 and at a 
30 wavelength of 0.98A. 
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The images of both data sets were processed with DENZO and scaled with 
SCALEPACK (Otwinowski, et aL, Methods EnzvmoL 276:307-326 (1997)) using the default - 
3a cutoff. 

Structure Determination and Refinement 
5 Initial efforts to determine the structure of the DES-LBD-GRIPl NR Box 11 peptide 

complex utilized a low resolution (3.1 A) data set (data not shown). A self-rotation search 
implemented with POLARRFN ("The CCP4 suite: programs for protein crystallography", Acta 
Crvstallogr . D 50:760-763 (1994)) indicated the presence of a noncrystallographic dyad. The 
two LBDs in the asymmetric were located by molecular replacement in AMoRe (CCP4, 1994) 

10 using a partial polyalanine model of the human ElARy LBD (Renaud, et al., supra ) as the search 
probe (R=58.2%, CC=35.6yo after placement of both monomers). Given that the model at this 
point was both inaccurate (r.m.s.d. 1 .7A between this model and the final model based on Ca 
positions) and incomplete (accounting for only -45% of the total scattering matter in the 
asymmetric unit), an aggressive density modification protocol was undertaken. Iterative cycles 

15 of two-fold NCS averaging in DM (CCP4, 1994) interspersed with model building in MOLOC 
(Muller, et ah. Bull . Soc . Chun. Belg. 97:655-667 (1988)) and model refinement in REFMAC 
(Murshudov, et al.. Acta Crvstallogr . D 53:240-255 (1997)) (using tight NCS restraints) were 
used to quickly build a model of the LBD alone. For this procedure, MAMA (Kleywegt, et al., 
"Halloween.. .masks and bones. In From First Map to Final Model", Bailey, et al, eds., 

20 Warrington, England, SERC Daresbury Laboratory, 1994) was used for all mask manipulations 
and PHASES (Furey, et ah, PA33 Am. Crvst . Assoc . Mtg. Abstr . 18:73 (1990)) and the CCP4 
suite (CCP4, 1994) were used for the generation of structure factors and the calculation of 
weights. 

However, although the DES-LBD-GRIPl NR Box II peptide complex model accounted 
25 for -90% of the scattering matter in the asymmetric unit, refinement was being hampered by 
severe model bias. The OHT complex data set was then collected. Starting with one of the 
monomers of the preliminary DES-LBD model as the search probe, molecular replacement in 
AMoRe was used to search for the location of LBD in this crystal form in both P6i22 and P6522. 
A translation search in P6522 yielded the correct solution (R=53.8%, CC=38.2%). In order to 
30 reduce model bias, DMMULTI (CCP4. 1994) was then used to project averaged density from the 
DES complex cell into the OHT complex cell. Using MOLOC. a model of the LBD was built 
into the resulting density. The model was refined initially in REFMAC and later with the 
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simulated annealing, positional and B-factor refinement protocols in X-PLOR (Brunger, X- 
PLOR Version 3.843. New Haven. Connecticut: Yale University. 1996) using a maximum- 
likelihood target (Adams, et al., Proc. Natl. Acad . Sci. USA 94:5018-23 (1997)). Anisotropic 
scaling and a bulk solvent correction were used and all B-factors were refined isotropically. 
Except for the Rfree set (a random sampling consisting of 8% of the data set), all data between 41 
and 1.9A (with no ct cutoff) were included. The final model consisted of residues 306-551, the 
ligand and 79 waters. According to PROCHECK (CCP4, 1994), 91.6% of all residues in the 
model were in the core regions of the Ramachandran plot and none were in the disallowed 
regions. 

The high resolution data set of the DES-LBD-GRIPl NR Box II peptide complex became 
available when the Rf,„ of the OHT-LBD model was ~3 1 %. Both monomers in the asymmetric 
unit of the DES complex crystal were relocated using AMoRe and the incompletely refined 
OHT-LBD model (with helix 12 and the loop between helices 1 1 and 12 removed) as the search 
model. The missing parts of the model were built and the rest of the model was corrected using 
MOLOC and two-fold averaged maps generated in DM. Initially, refinement was carried out 
with REFMAC using tight NCS restraints. At later stages, the model was refined without NCS 
restraints using the simulated annealing, positional and B-factor refinement protocols in X- 
PLOR and a maximum-likelihood target. All B-factors were refined isotropically and 
anisotropic scaling and a bulk solvent correction were used. The Rf„e set contained a random 
sample of 6.5% of all data. In refinement, all data between 27 and 2.03A (with no a cutoff) were 
used. The final model was composed of residues 305-549 of monomer A, residues 305-461 and 
470-554 of monomer B, residues 687-697 of peptide A, residues 686-696 of peptide B, two 
ligand molecules, 1 47 waters, two carboxymethyl groups and a chloride ion. According to 
PROCHECK, 93.7% of all residues in the model were in the core regions of the Ramachandran 
25 plot and none were in the disallowed regions. 

Figure 1 A provides a view of a 2Fo-Fc electron density map calculated at 2.03 A 
resolution and contoured at 1.0 a showing the GRIPl NR box II interaction with the LBD. The 
GRIPl NR Box II peptide (SEQ ID NO:4) was omitted from the model prior to map calculation. 
He 689 from the peptide and two of the three receptor residues with which it interacts (Glu 542 
and Leu 539) are labeled. Asp 538 has been omitted for clarity. The hydrogen bonds between 
the y-carboxylate of Glu 542 and the amides of residues 689 and 690 of the peptide are depicted 
as dashed orange bonds. Figure 1 B provides a view of a 2Fo-Fc electron density map calculated 
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at 1 .90A resolution and contoured at 1 .0 a showing the N-terminai region of helix 12. The 
dashed orange bonds depict the water-mediated hydrogen bond network between the imidazole 
ring of His 377. the y-carboxylate of Glu 380. and the amide of Tyr 537. The three labeled 
residues (Glu 380. Leu 536 and Tyr 537) interact with each other through van der Waals contacts 
5 and/or hydrogen bonds. Intriguingly, mutations in each these three residues dramatically 
increase the transcriptional activity of unliganded ERa LBD (Eng. et al., Moi. CeU. Biol . 
17:4644-4653 (1997); Lazennec, et al.. Mol. Endocrinol . 1 1:1375-86 (1997); White, et al.. 
EMBOJ. 16:1427-35 (1997)). 

Example 2 
Structure Determination 

GRIPl, a mouse pi 60 coactivator, interacts both in vivo and in vitro with the ERa LBD 
bound to agonist (Ding, et al., supra), but not with the LBD bound to antagonist (Norris. et al., J. 
Bioi. Chem. 273:6679-88 (1998)). Mutational studies of GRIPl and its human homologue TIF2 
suggest that of the three NR boxes from GRIPl, NR box II (residues 690 to 694) binds most 
1 5 tightly to the ERa LBD (Ding, et al., supra and VoegeL et al., supra) . 

Competition assays indicate that a 13 residue GRIPl NR Box II peptide, NHj- 
KHKILHRLLQDSS-CO2H (SEQ ID NO:4), corresponding to residues 686-698 of GRIPl (Ding, 
et al., suEia), synthesized by standard solid phase methods, binds specifically to the agonist- 
bound ERa LBD (IC5o<0.4|aM; Kushner, unpublished) and to other agonist-bound NR LBDs 
20 (Ding, et al., supra and Darimont, at al, supra V 

An electrophoretic mobility shift assay was used to demonstrate that the GRIPl NR Box 
II peptide (SEQ ID NO:4) bound the ERa LBD in the presence of the agonist. DES, but not the 
antagonist, OHT. Eight microgram samples of purified hERa-LBD bound to either DES or 
OHT were incubated in the absence of the GRIPl NR Box 11 peptide (SEQ ID NO:4), i.e., buffer 
alone, or in the presence of either a 2-fold or 10-fold molar excess of the GRIPl NR Box II 
peptide. The binding reactions were performed on ice for 45 minutes in 10 nl of buffer 
containing 20mM Tris, pH 8.1, ImM DTT, and 200mM NaCl and then subjected to 6% native 
PAGE. Gels were stained.with GELCODE Blue Stain reagent (Pierce). 

In the presence of the NR box II peptide, the migration of the DES-LBD complex was 
retarded. In contrast, peptide addition had no effect on the mobility of the OHT-LBD complex. 
Hence, this peptide fragment of GRIPl possesses the ligand-dependeni receptor binding activity 
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characierisiic of the full-length protein. These observations suggest that the GRIPl NR Box II 
peptide is a valid model for studying the interaction between GRIPl and the ERa LBD, 

In order to characterize structurally the interaction between the GRIPl NR Box II peptide 
and the ERa LBD, recombinant human ERa LBD (residues 297-554) was crystallized bound to 
both DES and the GRIPl NR Box II peptide. The ERa LBD bound to OHT was also 
crystalhzed in order to determine the mechanism by which this antagonist blocks 
coactivator/ERa interaction. X-ray diffraction data from these crystals were measured and the 
structures were determined by a combination of molecular replacement (using a modified 
version of the coordinates of the human retinoic acid receptor y (RARy) LBD, Renaud, et al., 
supra , as the search model) and aggressive density modification. 

The structure of the DES-ERa LBD-GRIPl NR Box II peptide complex has been refined 
to a crystallographic R-factor of 19.9% (Rfree=25.0%) using data to 2. 03 A resolution, as shown 
in Figure I A and Table 2. The structure of the OHT-ERa LBD complex has been refined using 
data to 1.90A to a crystallographic R- factor of 23.0% (Rfrcc=26.2%), as shown in Figure IB and 
Table 2. 

Table 2 

Summary of Crystallographic Statistics 

Ligand 





Data Collection 


DES 


OHT 


20 


Space group 


P2, 


P6522 




Resolution 


2.03 


1.90 




Observations 


104189 


269253 




Unique 


30265 


23064 




Completeness (%) 

Rsym(%)' 


98.4 


99.1 


25 


7.8 


7.0 




Average I/al 


9.8 


16.1 




Refinement 








Number of non-hydrogen atoms 


4180 


2070 




Rcrys, (%)''/Rf,ee (%) 


19.9/25.0 


23.0/26.1 


30 


Bond r.m.s. deviation (A) 


0.006 


0.006 




Angle r.m.s. deviation (°) 


1.05 


1.05 




Average B factor (A") 


34.0 


40.4 



35 



Rsym =Zi I It- I / Sili where <Ii> is the average 
intensity over symmetry equivalents 

Rcryst^ilFo-Fcl/ZlFol 
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Example 3 

Overall Simciure of the DES-LBD-GRIP 1 NR Box II Peptide Complex 
The asymmetric unit of the DES-LBD-GRIPl NR Box 11 peptide complex cr\'stals 
contains the same noncr>'siallographic dimer of LBDs that has been obserx'ed in the previously 
determined structures of the LED bound to both E2 and RAL (Brzozowski, et ai.. supra and 
Tanenbaum. et al.. supra ). Beyond the flexible loops between helices 2 and 3 and helices 9 and 
10, the two LBDs of the dimer adopt similar structures (r.m.s.d. 0.47A based on Ca positions). 
The conformation of each LBD complexed with DES closely resembles that of the LBD bound 
to E2 (Brzozowski. et al., supra ); each monomer is a wedge shaped molecule consisting of three 
layers of eleven to twelve helices and a single beta hairpin (Figure 2 A), In each LBD, the 
hydrophobic face of helix 12 is packed against helices 3. 5/6 and 1 1 covering the ligand binding 
pocket (Figure 2 A). One GRJPl NR Box II peptide is bound to each LBD in a hydrophobic cleft 
composed of residues from helices 3, 4, 5 and 12 and the turn between 3 and 4 (Figures 2 A and 
3A). The density for both peptides in the asymmetric unit is continuous and unambiguous 
15 (Figure 1 A). Residues 687 to 697 from the GRIP! NR Box II peptide, designated peptide A. and 
residues 686 to 696 from the GRIPl NR Box II peptide, designated peptide B, have been 
modeled; the remaining residues are disordered. Given that each peptide lies within a different 
environment within the crystal, it is striking that from residues He 689 to Gin 695 each peptide 
forms a two turn, amphipathic a helix (Figures 2A and 3 A). Flanking this region of common 
20 secondary structure, the peptides adopt dissimilar random coil conformations. 

The overall structures of the DES-ERa LBD-GRIPl NR Box II peptide, complex and the 
OHT-ERa LBD complex are illustrated in Figure 2. In Figure 2A, the coaciivator peptide and 
the LBD are shown as ribbon drawings. The peptide is colored gold and helix 12 (residues 538- 
546) is colored magenta. Helices 3, 4 and 5 (labeled H3. H4 and H5 respectively) are colored 
25 blue. DES, colored green, is shown in space-filling representation. In Figure 2B, the LBD is 
depicted as a ribbon drawing. As in Figure 2A, helix 12 (residues 536-544) is colored in 
magenta and helices 3. 4 and 5 are colored blue. OHT. in red, is shown in space-filling 
representation. 

Example 4 

3^ The NR Box II Peptide-LBD Interface 

The binding of the GRIPl NR Box II peptide to the ERa LBD buries lOOOA" of 

predominantly hydrophobic surface area from both molecules. The GRIPl NR Box II peptide 

binding site is a shallow groove composed of residues Leu 354. Val 355. He 358. Ala 361 and 
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Lys 362 from helix 3: Phe 367 and Val 368 from helix 4; Leu 372 from the lum between helices 
3 and 4; Gin 375, Val 376. Leu 379 and Glu 380 from helix 5: and Asp 538. Leu 539. Glu 542 
and Met 543 from helix 12 (Figure 3 A). The floor and sides of this groove are completely 
nonpolar. but the ends of this groove are charged (Figure 3C). 

5 The LBD interacts primarily with the hydrophobic face of the GRIPl NR Box II peptide 

a helix formed by the side chains of He 689 and the three LXXLL motif (SEQ ID NO:l) 
leucines (Leu 690, Leu 693 and Leu 694). The side chain of Leu 690 is deeply embedded within 
the groove and forms van der Waals contacts with the side chains of He 358, Val 376, Leu 379. 
Glu 380 and Met 543 (Figure 3 A and 3C). The side chain of Leu 694 is similarly isolated within 

10 the groove and makes van der Waals contacts with the side chains of He 358, Lys 362, Leu 372, 
Gin 375, Val 376 and Leu 379 (Figure 3 A and 3C), In contrast, the side chains of both He 689 
and the second NR box leucine. Leu 693, rest against the rim of the groove (Figure 3 A and 3C), 
The side chain of He 689 lies in a shallow depression formed by the side chains of Asp 538, Leu 
539 and Glu 542. The side chain of Leu 693 makes nonpolar contacts with the side chains of He 

15 358 and Leu 539. 

The charged and polar side chains which form the hydrophilic face of the peptide helix 
project away from the receptor and either interact predominantly with solvent or form symmetry 
contacts (Figure 1 A). None of the side chains of the polar and charged residues outside the 
helical region of either peptide in the asymmetric unit, with the exception of Lys 688 of peptide 

20 B, is involved in hydrogen bonds or salt bridges with its associated LBD monomer. The e-amino 
group of Lys 688 of peptide B hydrogen bonds to the side chain carboxylate of Glu 380 of 
monomer B. This interaction is presumably a crystal artifact; the main chain atoms of the N- 
lerminal three residues of peptide B are displaced from monomer B and interact extensively with 
a symmetry-related LBD. 

25 In addition to interacting with the hydrophobic face of the peptide helix, the LBD 

stabil izes the main chain conformation of the NR box peptide by forming capping interactions 
with both ends of the peptide helix, Glu 542 and Lys 362 are positioned at opposite ends of the 
peptide binding site (Figure 3 A). The side chains of Glu 542 and Lys 362 form van der Waals 
contacts with main chain and side chain atoms at the N- and C-terminal turns of the peptide helix 

30 respectively . These interactions position the stabilizing charges of the y-carboxyiate of Glu 542 
and £-amino group of Lys 362 near the ends of the GRIPl NR Box II peptide helix (Figure 3C). 
The side chain carboxylate (Y-carboxylate) of Glu 542 hydrogen bonds to the amides of the 
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residues of N-terminal turn of the peptide helix (residues 688 and 689 of peptide A: residues 689 
and 690 of peptide B) (Figure 1 A). Similarly, the e-amino group of Lys 362 hydrogen bonds to 
the carbonyls of the residues of the C-terminal turn of the peptide helix (residue 693 of peptide 
A; residues 693 and 694 of peptide B) (Figure 5). 

To test the importance of the GRIPl NR Box II peptide/LBD interface observed in the 
crystal, a series of site-directed mutations were introduced into the ERa LBD. These mutations 
were designed either to simultaneously perturb the structural integrity and the nonpolar character 
of the floor of the binding groove (He 358->Arg, Val 376->Arg and Leu 539->Arg) or to prevent 
the formation of the capping interactions (Lys 362->Ala and Glu 542->Lys). Fusions of 
glutathione-S-transferase (GST) to the wild-type and mutant LBDs were analyzed for their 
ability to bind 35S-|abeled GRJPl in the absence of ligand or in the presence of DES or OHT. 

^"^S-labeled GRJPl was incubated with either immobilized GST, immobilized wild type 
GST-hERa LBD, or immobilized mutant GST-LBDs in the absence of ligand or in the presence 
of DES or OHT. The bound GRIPl was quantitated after SDS-PAGE. I358R, mutant LBD 
15 containing a Ile->Arg substitution at residue 358; K362A, mutant LBD containing a Lys->Ala 
substitution at residue 362; V376R, mutant LBD containing a Val->Arg substitution at residue 
376; L539R, mutant LBD containing a Leu->Arg substitution at residue 539; E542K, mutant 
LBD containing a Glu->Lys substitution at residue 542 

In the absence of ligand or in the presence of OHT, fusions to the wild-type protein and 
20 all of the mutant LBDs showed no detectable binding to GRIPl. The He 358->Arg, Val 376- 
>Arg and Leu 539->Arg mutants were all unable to interact with coactivator in the presence of 
agonist, confirming the importance of the packing interactions observed in the crystal. 
Disruption of either the N- or C-terminal capping interaction also compromised GRJPl binding 
in the presence of agonist. Only the wild-type GST-LBD was able to recognize the coactivator 
25 in the presence of DES. 

Example 5 
Agonist Recognition 

Despite its different shape and larger molecular volume, DES (273 A^) is accommodated 
within the same binding pocket that recognizes E2 (252A^). In its receptor complex, DES is 
30 completely encased within the narrower half of the LBD in a predominantly hydrophobic cavity 
composed of residues from helices 3, 6, 1. 8, I I, and 12 as well as the S1/S2 hairpin (Figures 2 A 
and 4A). 
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The interaction of DES with ERa resembles that of Ev One of the phenolic rings of DES 
lies in the same position as the E, A ring near helices 3 and 6. Like the aromatic ring of the 
the DES A ring (Figure 4A) is engaged by the side chains of Phe 404. Ala 350. Leu 387 and Leu 
391 with its phenolic hydroxyl forming hydrogen bonds to the y-carboxylaie of Glu 353. to the 
guanidinium group of Arg 394. and to a structurally conserved water molecule. The A' ring of 
DES (Figure 4 A) is bound near helices 7, 8 and 11 adjacent to the location of the E, C and D 
rings. This ring forms van der Waals Contacts not only with Gly 52 1 and Leu 525, like the D 
ring of E,, but also with Met 343. Leu 346 and Met 421 (Figure 4A). Even though it is located 
1.7A from the position of the D ring hydroxyl, the DES A' ring phenolic hydroxyl is still able to 
hydrogen bond to the imidazole ring of His 524 (Figure 4 A). 

DES also forms contacts with the LBD that E^ does not. There are unoccupied cavities 
adjacem to the a face of the B ring and the p face of the C ring of the E, (Brzozowski. et al.. 
suEiaand Tanenbaum, et al., suEra). The ethyl groups of DES. which project perpendicularly 
from the plane of the phenolic rings, fit snugly into these spaces. The resulting additional 
nonpolar contacts with the side chains of Ala 350, Leu 384, Phe 404, and Leu 428 (Figure 4A) 
may account for the higher affinity of DES for the receptor (Kuiper, et al.. Endocrinoloav 
138:863-70(1997)). 

Except for Met 421 and Met 528 (both of which contact only DES) and Met 388 and He 
424 (both of which contact only E^), the ER is able to use the same residues to form all of the 
observed hydrogen bonds and van der Waals contacts with both agonists (Figure 4A, 
Brzozowski. et al.. suEra, and Tanenbaum, et al., suEm). This remarkable adaptability is 
presumably the result of both relatively large molecular volume of the binding pocket (~500A^ in 
both complexes) and its apparent structural plasticity. In panicular. at the DES A' ring/steroid D 
ring end of the binding pocket. Met 343, Met 421. His 524 and Met 528 adopt differem packing 
configurations in response to each ligand (data not shown). This plasticity may be necessary to 
allow the receptor to recognize endogenous estrogens such as estrone and estriol. which differ 
structurally from E2 at the D ring. 

Example 6 
Structure of the OHT-LBD Comp lev 

The binding of OHT induces a conformation of the LBD that differs in both secondary 
and tertiary structural organization from that driven by DES binding. In the DES complex, the 
main chain from residues 339 to 341. 421 to 423. and 527 to 530 form parts of helices 3. 8 and 
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1 1 respectively. In contrast, these regions adopt an extended conformation in the OHT complex 
(Figures 2A. 2B and 6A). In addition, the composition and orientation of helix 12 are different 
in the two structures. Helix 12 in the DES complex consists of residues 538 to 546 whereas 
helix 1 2 in the OHT complex consists of residues 536 to 544. Most dramatically, rather than 
5 covering the ligand binding pocket as it does in the DES complex, helix 1 2 in the OHT complex 
occupies the part of the coactivator binding groove formed by residues from helices 3, 4. and 5, 
and the turn connecting helices 3 and 4 (Figures 2A, 2B and 3B). This alternative conformation 
of helix 12 appears to be similar to that observed in the RAL complex (Brzozowski, et al., 
supra ). 

\ 0 Example 7 

Helix 12-LBD Interface 

Except for the orientation of helix 12. the structure of the peptide binding groove is 
almost identical in the DES and OHT complexes (Figures 3 A and 3B). The region of this groove 
outside of helix 12 is referred to herein as the "static region" of the NR box binding site. Helix 
15 1 2 in the OHT complex and the NR box peptide helix in the DES complex interact with the static 
region of the coactivator recognition groove in strikingly similar ways. 

Helix 12 mimics the hydrophobic interactions of the NR box peptide with the static 
region of the groove with a stretch of residues (residues 540 to 544) that resembles an NR box 
(LLEML instead of LXXLL) (SEQ ID NO:3 instead of SEQ ID NO: I). The side chains of Leu 
20 540 and Met 543 lie in approximately the same locations as those of the first and second motif 
leucines (Leu 690 and Leu 693) in the peptide complex (Figure 5). Leu 540 is inserted into the 
croove and makes van der Waals contacts with Leu 354, Val 376 and Glu 380 (Figures 3B and 
3D). Met 543 lies along the edge of the groove and forms van der Waals contacts with the side 
chains of Leu 354, Val 355 and He 358 (Figures 3B and 3D). The side chain position of Leu 544 
25 almost exactly overlaps that of the third NR box leucine. Leu 694 (Figure 5). Deep within the 

groove, the Leu 544 side chain makes van der Waals contacts with the side chains of He 358, Lys 
362. Leu 372, Gin 375, Val 376 and Leu 379 (Figures 3B and 3D). 

Helix 12 in the OHT complex is also stabilized by N- and C-terminal capping 
interactions. Lys 362 interacts with the C-ierminal turn of helix 12 much as il does with the 
30 equivalent turn of the peptide helix (Figures 3 A and 3B). The Lys 362 side chain packs against 
the C-terminal turn of the helix 12 with its e-amino group hydrogen bonding to the carbonyls of 
residues 543 and 544 (Figure 5). Given that the capping interaction at the N-lerminal turn 

coactivator helix is formed bv a helix 1 2 residue (Glu 542). the N-terminal turn of helix 12 in the 
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antagonist complex is forced to interact with another residue. Glu 380 (Figures 3B and 3D). The 
Glu 380 Y-carboxylate forms van der Waals contacts with Tyr 537 and interacts with the amide 

a series of waier-mediated hydrogen bonds (Figure IB). 
In addition to forming these "NR box-like" interactions, helix 12 also forms van der 
5 Waals contacts with areas of the LBD outside of the coaciivator recognition groove. The side 
chain of Leu 536 forms van der Waals contacts with Glu 380 and Trp 383 and that of Tyr 537 
forms van der Waals contacts with His 373, Val 376 and Glu 380 (Figures IB. 3B and 3D). As a 
result of the-ce contacts, helix 12 in the OHT complex buries more solvent accessible surface area 
(--1200A^) than the NR box peptide in the DES complex. 

Examples 
OHT RecQgnitinn 

OHT is bound within the same pocket that recognizes DES, E2 and RAL. The orientation 
of OHT within the binding pocket appears to be dictated by the positioning of two structural 
features of this ligand, the phenolic A ring and the bulky side chain (Figures 4B and 6C). The A 
nng of OHT is bound in approximately the same location as the A ring of DES near helices 3 
and 6 with its phenolic hydroxyl hydrogen bonding to a structurally conserved water and to the 
side chains of Glu 353 and Arg 394 (Figure 4B). Like the bulky side chain of RAL, the side 
chain of OHT exits the binding pocket between helices 3 and 1 1 (Figures 2B and 4B). The OHT 
C ring (Figure 4B) forms van der Waals contacts with the side chains of Met 343, Leu 346, Thr 
347, Ala 350, Tip 383. Leu 384, Leu 387 and Leu 525. The positioning of the flexible 
dimethylaminoethyi region of the side chain is stabilized by van der Waals contacts with Thr 
347, Ala 350 and Trp 383 and by a salt-bridge between the dimethylamino group of the side 
chain and the p-carboxylate of Asp 351, which lies 3.8A away (Figure 4B). The positions of the 
A ring and the side chain in the context of the rigid triphenylethylene framework of OHT 
requires that the ethylene group of OHT lie in an orientation nearly orthogonal to that of the 
ethylene group of DES (Figures 4 A, 4B and 6D). As a result, the B ring of OHT is driven more 
deeply into the binding pocket than the A' ring of DES (Figures 6B and 6C). 

This location of the OHT B ring apparently cannot be accommodated by the same 
mechanisms that allow the DES A* ring/E, D ring end of the binding pocket to adapt to the 
different structural features of DES and E.. Instead, the residues that contact the B ring (Met 
343, Leu 346. Met 421. lie 424. Gly 52L His 524 and Leu 525). most of which also interact with 
the A- ring of DES. adopt conformations distinct from the ones they adopt in the DES structure 

(Figure 6D). In fact, the location of the B ring actuallv precludes the side chain of one residue 
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Met 42 L from adopting the same conformation that it adopts in the DES structure (Figures 6B 
and 6CV As a consequence of these B ring-induced side chain conformations, many interresidue 
van der Waais contacts present in the DES complex are absent in the OHT complex. For 
example, whereas Met 421 packs against His 524 from helix 1 1 and against Met 343 from helix 
5 3 in the agonist complexes, it is precluded by the location of the OHT B ring from interacting 
with either of these residues in the antagonist complex (Figure 6D). 

The structural effects of the placement of the B ring are not limited to the residues that 
contact the B ring; the conformations of these residues force other residues throughout the 
binding pocket to, in turn, adopt alternative conformations. For instance, the conformation 
1 0 adopted by Met 42 1 in the OHT complex prevents the side chains of Phe 404 and Phe 425 from 
occupying the positions they take in the DES complex (Figure 6B and 6C). As a consequence, 
Phe 404 does not make van der Waals contacts with the OHT A ring as it does with the A rings 
of DES or E2 (Figure 6C). In fact. Phe 404 only contacts the ethyl group of OHT (Figures 6C 
and 6D). The alternative conformations of the side chains of both the residues that directly 
1 5 contact the B ring and those that are indirectly affected by it, force the main chain throughout the 
binding pocket to adopt a different conformation as well (Figure 6D). 

Identification and characterization of key residues within ligand binding domain of the 
ERa and extension of this information to other nuclear receptors shows that these residues are 
common for all nuclear receptors identified to date. Thus, the Examples presented herein 
20 demonstrate that information derived from the structure and function of the ERa ligand binding 
domain can be applied in design and selection of compounds that modulate binding of 
compounds to nuclear receptors for all members of the nuclear receptor family. 

All publications and patent applications mentioned in this specification are herein 
incorporated by reference to the same extent as if each individual publication or patent 
25 application was specifically and individually indicated to be incorporated by reference. 

The invention now being fully described, it will be apparent to one of ordinary skill in the 
art that many changes and modifications can be made thereto without departing from the spirit or 
scope of the appended claims. 
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Appendix 1 

Atomic Coordinaies for Human ERa Complexed With DES and a GRIPl NR Box II Peptide 
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8.335 

6.339 

6.833 

8.323 

9.044 

9.030 

8.414 

9.435 

7.897 

8.531 



-0.435 
0.910 
0.048 
-0.691 
-0.928 
-0.064 
0.331 
1.299 
0.578 
1.889 
3.080 
-0.922 
-0.970 
-1.943 
-3.183 
-3.819 
-4.318 
-3.787 
-4.343 
-4. 1 94 
-4.342 
-5.725 
-4.148 
-3.620 
-4.243 
-2.295 
-1.469 
0.042 
0.897 
0.414 
-1.754 
-1.832 
-1.924 
-2.204 
-2.199 
-2.857 
-2.339 
-3.302 
-1.154 
-4.077 
-4.351 
-3.120 
-0.971 
-1.950 
-3.547 
-3.650 
-4.568 
-5.890 



1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1. 00 



15.12 

18.73 

18.34 

22.62 

22.49 

21.45 

18.54 

19.35 

15.50 

21.95 

15.86 

18.67 

19.75 

20.58 

19.33 

17.48 

21.87 

19.17 

21.50 

18.25 

21.98 

21.39 

17.15 

23.27 

23.60 

23.04 

24.42 

27.38 

30.37 

23.48 

27.55 

27.50 

23.65 

24.76 

22.86 

25.90 

28.09 

29.94 

30.20 

26.55 

30.64 

32.43 

34.58 

36.54 

25.02 

20:97 

23.06 

25.97 
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ATOM 708 CB ARG A 394 

ATOM 709 CG ARG A 394 

ATOM 710 CD ARG A 394 

ATOM 711 NE ARG A 394 

5 ATOM 712 CZ ARG A 394 

ATOM 713 NHl ARG A 394 

ATOM 714 NH2 ARG A 394 

ATOM 715 C ARG A 394 

ATOM 716 O ARG A 394 

10 ATOM 717 N SER A 395 

ATOM 718 CA SER A 395 . 

ATOM 719 CB SER A 395 

ATOM 720 OG SER A 395 

ATOM 721 C SER A 395 

15 ATOM 722 O SER A 395 

ATOM 723 N MET A 396 

ATOM 724 CA MET A 396 

ATOM 725 CB MET A 396 

ATOM 726 CG MET A 396 

20 ATOM 727 SD MET A 396 

ATOM 728 CE MET A 396 

ATOM 729 C MET A 396 

ATOM 730 O MET A 396 

ATOM 731 N GLU A 397 

25 ATOM 732 CA GLU A 397 

ATOM 733 CB GLU A 397 

ATOM 734 CG GLU A 397 

ATOM 735 CD GLU A 397 

ATOM 736 OEl GLU A 397 

30 ATOM 737 OE2 GLU A 397 

ATOM 738 C GLU A 397 

ATOM 739 O GLU A 397 

ATOM 740 N HIS A 398 

ATOM 741 CA HIS A 398 

35 ATOM 742 CB HIS A 398 

ATOM 743 CG HIS A 398 

ATOM 744 CD2 HIS A 398 

ATOM 745 NDl HIS A 398 

ATOM 746 CEl HIS A 398 

40 ATOM 747 NE2 HIS A 398 

ATOM 748 C HIS A 398 

ATOM 749 O HIS A 398 

ATOM 750 N PRO A 399 

ATOM 751 CD PRO A 399 

45 ATOM 752 CA PRO A 399 

ATOM 753 CB PRO A 399 
ATOM 754 CG PRO A 399 
ATOM 755 C PRO A 399 



15.776 


7.633 


-6.965 


1 .00 


24.05 


16.243 


6.195 


-7.024 


1. 00 


26.05 


15.830 


5.551 


-8.352 


\ .00 


22. 7U 


14.443 


5.071 


-8.363 


\ .00 


20. / 1 


14.053 


3.912 


-7.841 


\ .00 


21 .26 


14.944 


3.108 


'1261 


1 .00 


20.09 


12.783 


3.544 


-7.907 


\ .00 


21 .26 


15.622 


9.833 


-D.879 


1 .00 


2^.40 


15.889 


10.729 


'6,611 


\ .00 


28.61 


14.638 


9.924 


-4.988 


1. 00 


26.63 


13.776 


11.104 


-4,902 


1.00 


27 Ao 


12.395 


10.696 


-4.382 


1.00 


26.70 


1 1.916 


9.530 


-:).029 


1 .00 


22.95 


14.316 


12.240 


-4.033 


1.00 


J 1.45 


13.726 


13.324 


-3.977 


1.00 


28.1 1 


15.437 


11.986 


-3.368 


1.00 


j>3.83 


16.061 


12.954 


-2.475 


1.00 


38.83 


17.466 


12.483 


-2.112 


1.00 


39.47 


17.585 


11.919 


-0.715 


1.00 


41.37 


19.192 


12.262 


0.004 


1.00 


42.20 


20.263 


1 1 .996 


-1.404 


1.00 


42.84 


16.143 


14.376 


-3.018 


1.00 


40.69 


15.637 


15.316 


-2.403 


1.00 


38.85 


16.794 


14.526 


-4.166 


1.00 


42.19 


16.971 


15.831 


-4.790 


1.00 


44.80 


18.184 


15.785 


-5.729 


1.00 


46.02 


17.883 


15.189 


-7.096 


1.00 


54A2 


19.117 


14.665 


-7.810 


1.00 


59.40 


19.219 


13.430 


-7.990 


1.00 


60.63 


19.980 


15.485 


-8.196 


1.00 


62.71 


15.735 


16.322 


-5.554 


1.00 


42.94 


15.830 


1 7.229 


-6.376 


1.00 


44.68 


14.579 


15.728 


-5.280 


1.00 


40.82 


13.342 


16.118 


-5.950 


1.00 


39.21 


12.924 


15.043 


-6.956 


1.00 


39.05 



13.870 14.886 -8.104 1.00 41.57 

13 904 15.484 -9.318 1.00 39.28 

14.940 14.017 -8.074 1.00 41.85 

15 592 14.086 -9.220 1.00 40.88 

14.985 14.969 -9.993 1.00 42.30 

12.216 16.332 -4.944 1.00 37.04 

11.282 15.535 -4.864 1.00 36.51 

12.283 17.427 -4.171 1.00 39.19 
13 328 18.467 -4.198 1.00 35.36 
11.243 17.709 -3.173 1.00 37.10 
11.603 19.101 -2.654 1.00 37.86 
13.050 19.267 -2.963 1.00 35.83 

9.828 17.663 -3.744 1. 00 37.02 

64 



BNSOOCID: <WO. 



9950658A2J.> 



wo 99/50658 



PCTAJS99/06937 





ATOM 


756 


0 


PRO 


A 


399 


9.554 


18.249 


-4.789 


1.00 


38 S'' 




ATOM 


757 


N 


GLY 


A 


400 


8.938 


16.954 


-3.057 


1.00 


33 58 




ATOM 


758 


CA 


GLY 


A 


400 


7.559 


16.865 


-3.503 


1.00 


32.12 




ATOM 


759 


c 


GLY 


A 


400 


7.230 


15.706 


-4.428 


1.00 


32.43 

^y **• * V «y 


5 


ATOM 


760 


0 


GLY 


A 


400 


6.063 


15.344 


-4.574 


1.00 


33.21 




ATOM 


761 


N 


LYS 


A 


401 


8.237 


15.1 12 


-5.055 


1.00 


31.35 

«y • * ^y ^y 




ATOM 


762 


CA 


LYS 


A 


401 


7.972 


14.007 


-5.966 


1.00 


30 75 

^ w 4 9 ^y 




ATOM 


763 


CB 


LYS 


A 

4 * 


401 


8.235 


14.430 


-7.415 


1. 00 


35 43 

^y m T ^y 




ATOM 


764 


CG 


LYS 


/\ 


401 


8.130 


15.927 


-7.675 


1.00 


35 15 


10 


ATOM 


765 


CD 


LYS 


A 


401 


9.096 


16.353 


-8.774 


1.00 


36 88 




ATOM 


766 


CE 


LYS 


A 


401 


8.733 


17.721 


-9.331 


1 00 

4 • \y vy 


36 71 

V# . ^ 1 




ATOM 


767 

f x^ * 


NZ 


LYS 


A 


401 


7 295 


18.027 


-9 1 16 


1 00 

4 « vy ^/ 


34 22 




ATOM 


768 

■ xr 


c 


LYS 


A 


401 


8 768 


12.746 


-5.611 


1 00 


30 97 

*y f 




ATOM 


769 


0 


LYS 


\ 


401 


9 809 


P 776 


-"^ 006 


1 00 


">1 60 


15 


ATOM 


770 


N 


LEU 


A 


402 


8 256 


1 1 635 


-6 197 


1 00 


'^7 28 

mm 1 .XO 




ATOM 


771 


CA 


LEU 


A 


402 


8 889 


10 334 


-6 050 


1 00 


07 
— ~.\/ / 




ATOM 


772 


CB 


LEU 


A 


402 


7 866 


9 '^94 


-"^ 590 


1 00 






ATOM 


773 


CG 


LEU 


A 


402 


1 265 


9 555 


-4 '^07 


\ 00 






ATOM 


774 


CDl 


LEU 


A 


402 


6 126 


8 583 


-3 937 

^ • y mJ / 


1 00 

1 * W V/ 


i y »J ^ 


20 


ATOM 


775 


CD2 


LEU 


A 
* k 


402 


8 355 


9 416 

X . r I vy 


-3 157 


1 00 

1 •\J\J 






ATOM 


116 


c 


LEU 

i—/ 


A 


402 


9 448 


9 948 


-7 414 


1 00 


28 78 




ATOM 


111 

w 9 i 


o 


LEU 


A 


402 


8 704 


9 836 


-8 389 


1 00 


'^9 98 

^y* yO 




ATOM 


11% 


N 


LEU 


A 


403 


10 761 


9 770 


-7 487 


1 00 






ATOM 


119 


CA 


LEU 


A 


403 


1 1 393 


9 400 


-8 744 


1 00 

1 • w 


27 17 


25 


ATOM 


780 


CB 


LEU 

1^ V-^ 


A 


403 


P 825 


9 937 


-8 816 


1 00 


26 9S 

^V/. 7i./ 




ATOM 


781 


CG 


LEU 


A 


403 


13 401 


10 027 


-10 "^38 

1 \y • mm^ \J 


1 00 


30 42 




ATOM 


782 


CDl 


LEU 


A 


403 


14 519 


1 1 046 


-10 288 


1 00 

1 . \J\J 


30 76 




ATOM 


783 


CD2 


LEU 


A 


403 


13 915 


8 665 


-10 676 


1 00 


33 1 1 




ATOM 


784 


c 


LEU 


A 


403 


11419 


1 891 


-8 901 


1 00 

1 . w 


Od 78 


30 


ATOM 


785 


o 


LEU 


A 


403 


12 428 


7 '^57 


-8 619 


1 00 


'?4 68 




ATOM 


786 


N 


PHE 


A 
i k 


404 


10 306 


7 319 


-9 344 


1 00 


'^3 1 1 




ATOM 


1%1 


CA 


PHE 


A 


404 


10 239 


5 881 


-9 546 


1 00 


^6 93 




ATOM 


788 


CB 


PHE 
1 11 i~i 


A 


404 


8 826 


5 470 


-9 946 


1 00 

1 .V/V/ 


^7 04 

^ / • V/ • 




ATOM 


789 


CG 


PHE 


A 


404 


7 850 


5 513 


-8 816 

vy • vy A 


1 00 


^^7 89 

m^ 1 »\j y 


35 


ATOM 


790 

' ^ x^ 


CDl 


PHE 


A 


404 


7.028 


6.623 


-8.631 


1 00 

4 • vy vy 


'^6 20 




ATOM 


791 


CD2 


PHE 


A 


404 


7.750 


4.444 


-7.925 


1 00 

& * vy vy 


23 10 




ATOM 


792 


CEl 


PHE 


A 


404 


6.1 16 


6.668 


-7.573 


1 00 

* * vy vy 


25 29 




ATOM 


793 


CE2 


PHE 


A 


404 


6.845 


4.481 


-6.870 


1.00 


21.01 

A * vy * 




ATOM 


794 


CZ 


PHE 


A 


404 


6.026 


5.595 


-6.693 


1.00 


22.91 


40 


ATOM 


795 


C 


PHE 


A 


404 


1 1.232 


5.507 


-10.637 


1.00 


26.04 

fcrf vy i vy ■ 




ATOM 


796 


0 


PHE 


A 


404 


1 1.882 


4.464 


-10.578 


1.00 


21.21 




ATOM 


797 


N 


ALA 


A 


405 


1 1 348 


6 383 


-1 1 .6'^6 

4*4 xy Vy 


1 00 

1 . \J\J 


'^8 80 

M (J . UV/ 




ATOM 


798 


CA 


ALA 


A 


405 


12.271 


6.195 


-12.740 


1.00 


29.21 




ATOM 


799 


CB 


ALA 


A 


405 


11.650 


5.287 


-13.806 


1.00 


26.89 


45 


ATOM 


800 


C 


ALA 


A 


405 


12.549 


7.578 


-13.317 


1.00 


30.23 




ATOM 


801 


O 


ALA 


A 


405 


11.770 


8.508 


-13.109 


1.00 


27.38 




ATOM 


802 


N 


PRO 


A 


406 


13.672 


7.737 


-14.032 


1.00 


30.05 




ATOM 


803 


CD 


PRO 


A 


406 


14.712 


6.745 


-14.352 


1.00 


26.31 



65 



BNSOOCID: <WO 



.995065aA2_L> 



wo 99/50658 PCT/US99/06937 





ATOM 


804 


CA 


PRO 


A 


406 


13.977 


9.053 


- 


14.604 


1.00 


32.10 




ATOM 


805 


CB 


PRO 


A 


406 


15.232 


8.800 


- 


15.438 


1.00 


31.28 




ATOM 


806 


CG 


PRO 


A 


406 


15.865 


7.602 


- 


14.776 


1. 00 


31.44 




ATOM 


807 


C 


PRO 


A 


406 


12.820 


9.589 


- 


15.436 


1.00 


32.58 


5 


ATOM 


808 


O 


PRO 


A 


406 


12.605 


10.796 


- 


15.507 


1.00 


32.58 




ATOM 


809 


N 


ASN 


A 


407 


12.063 


8.690 


- 


16.053 


1.00 


32.86 




ATOM 


810 


CA 


ASN 


A 


407 


10.935 


9.119 


- 


16.865 


1.00 


32.78 




ATOM 


811 


CB 


ASN 


A 


407 


10.950 


8.418 


•* 


18.228 


1.00 


34.73 




ATOM 


812 


CG 


ASN 


A 


407 . 


10.884 


6.907 


- 


18.121 


1.00 


-)5.j7 


10 


ATOM 


813 


ODl 


ASN 


A 


407 


11.189 


6.317 




17.077 


1.00 


30.24 




ATOM 


814 


ND2 


ASN 


A 


407 


10.486 


6.268 


- 


19.215 


1.00 


34.08 




ATOM 


815 


C 


ASN 


A 


407 


9.605 


8.901 


- 


16.166 


1.00 


34.90 




ATOM 


816 


O 


ASN 


A 


407 


8.549 


8.897 


- 


16.798 


1.00 


36.09 




ATOM 


817 


N 


LEU 


A 


408 


9.660 


8.724 


- 


14.851 


1.00 


33.56 


15 


ATOM 


818 


CA 


LEU 


A 


408 


8.452 


8.544 


- 


14.061 


1.00 


35.59 




ATOM 


819 


CB 


LEU 


A 


408 


8.141 


7.062 


- 


13.851 


1.00 


33.81 




ATOM 


820 


CG 


LEU 


A 


408 


6.696 


6.823 


- 


13.397 


1.00 


36.44 




ATOM 


821 


CDl 


LEU 


A 


408 


5.746 


7.479 


- 


14.390 


1.00 


34.14 




ATOM 


822 


CD2 


LEU 


A 


408 


6.406 


5.334 


- 


13.287 


1.00 


32.96 


20 


ATOM 


823 


C 


LEU 


A 


408 


8.607 


9.245 


- 


12.717 


1.00 


38.03 




ATOM 


824 


O 


LEU 


A 


408 


8.880 


8.614 


- 


11.695 


1.00 


36.38 




ATOM 


825 


N 


LEU 


A 


409 


8.441 


10.563 


- 


12.741 


1.00 


37.87 




ATOM 


826 


CA 


LEU 


A 


409 


8.548 


11.395 


- 


11.553 


1.00 


37.95 




ATOM 


827 


CB 


LEU 


A 


409 


9.373 


12.636 


- 


11.877 
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10.023 
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1.00 
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45.60 


CEl 
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A 


476 


21.207 


-11. 928 


15.177 


1.00 


47.56 


NE2 


HIS 
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476 


21.091 
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15.21 1 


1.00 


46.21 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 
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ATOM 

ATOM 
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ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 
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1 J J J 
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1336 
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1345 
1346 
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1348 
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1352 
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1360 
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1367 
1368 
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1372 
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1375 
1376 
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1378 
1379 



C 
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NH2 
C 
O 
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CB 
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CG2 
C 
O 
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CA 
CB 
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C 
O 
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CB 
CG 
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OD2 
C 
O 
N 
CA 
CB 
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CD 
CE 
NZ 
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O 
N 
CA 
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HIS 
HIS 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
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LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
ASP 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
ILE 
ILE 
ILE 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
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A 
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476 

476 

477 

477 

477 

477 

477 

477 

477 

477 

477 

477 

477 

478 

478 

478 

478 

478 

478 

478 

479 
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479 

479 

479 

479 
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479 

480 

480 

480 

480 

480 

480 

480 

480 

481 

481 

481 

481 

481 

481 

481 

481 

481 

482 

482 

482 



26.438 

26.634 

27.420 

28.796 

29.757 

29.800 

30.782 

3 1 .780 

32.780 

32.918 

33.643 

28.906 

29.462 

28.369 

28.389 

27.658 

27.672 

28.319 

27.689 

28.216 

26.499 

25.727 

24.474 

23.211 

22.056 

22.864 

26.592 

26.595 

27.324 

28.206 

28.878 

27.990 

28.355 

26.935 

29.283 

29.672 

29.767 

30.794 

31.306 

32.158 

32.894 

33.883 

34.954 

30.260 

30.979 

28.996 

28.421 

27.066 



-9.966 
-8.774 
-10.805 
-10.331 
-1 1.506 
-12.459 
-13.599 
-13.675 
-12.811 
-11.803 
-12.955 
-9.361 
-8.268 
-9.766 
-8.930 
-9.605 
-8.678 

-10.933 
-7.610 
-6.536 
-7.702 
-6.516 
-6.912 
-7.229 
-7.503 
-6.063 
-5.582 
-4.370 
-6.158 
-5.388 
-6.305 
-6.602 
-7.505 
-5.944 
-4.699 

-3.562 

-5.394 

-4.830 

-5.890 

-6.953 

-7.799 

-6.963 

-6.388 

-3.635 

-2.657 

-3.705 

-2.598 

-2.983 



3.170 
3.415 
2.862 
2.795 
2.605 
3.788 
3.557 
4.622 
4.770 
3.918 
5.766 
1.621 
1.753 
0.475 
9.280 
8.100 
6.890 
7.761 
9.584 
9.294 
10.171 
10.530 
11.324 
10.517 
11.481 
9.584 
11.369 
11.158 
12.320 
13.193 
14.222 
15.417 
16.198 
15.580 
12.361 
12.636 
1 1 .340 
10.477 
9.512 
10.188 
9.157 
8.350 
9.215 
9.696 
9.463 
9.291 
8.545 
7.915 



1.00 

1. 00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 



35.40 

35.45 

34.07 

34.18 

41.04 

47.61 

55.67 

60.17 

61.98 

64.29 

62.79 

30.77 

33.59 

27.65 

27.07 

28.00 

25.83 

31.66 

26.92 

26.97 

25.74 

27.97 

25.55 

29.01 

27.05 

24.92 

25.39 

27.39 

26.04 

27.32 

26.67 

31.02 

31.50 

32.21 

25.59 

27.15 

25.17 

24.93 

28.42 

35.59 

41.21 

41.48 

43.22 

26.12 

23.73 

25.44 

27.69 

27.59 
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ATOM 1380 CG2 ILE A 482 26.470 

ATOM 1381 CGI ILE A 482 27.274 

ATOM 1382 GDI ILE A 482 26.000 

ATOM 1383 C ILE A 482 28.253 

ATOM 1384 O ILE A 482 28.312 

ATOM 1385 N THR A 483 28.046 

ATOM 1386 CA THR A 483 27.905 

ATOM 1387 CB THR A 483 27.535 

ATOM 1388 OGl THR A 483 26.181 

ATOM 1389 CG2 THR A 483 27.673 

ATOM 1390 C THR A 483 29.257 

ATOM 1391 O THR A 483 29.331 

ATOM 1392 N ASP A 484 30.324 

ATOM 1393 CA ASP A 484 31.674 

ATOM 1394 CB ASP A 484 32.718 

ATOM 1395 CO ASP A 484 32.629 

ATOM 1396 ODl ASP A 484 32.002 

ATOM 1397 OD2 ASP A 484 33.185 

ATOM 1398 C ASP A 484 31.930 

ATOM 1399 O ASP A 484 32.481 

ATOM 1400 N THR A 485 31.505 

ATOM 1401 CA THR A 485 31.689 

ATOM 1402 CB THR A 485 31.124 

ATOM 1403 OGl THR A 485 31.753 

ATOM 1404 CG2 THR A 485 31.381 

ATOM 1405 C THR A 485 30.994 

ATOM 1406 O THR A 485 31.583 

ATOM 1407 N LEU A 486 29.743 

ATOM 1408 CA LEU A 486 28.973 

ATOM 1409 CB LEU A 486 27.567 

ATOM 1410 CG LEU A 486 26.508 

ATOM 1411 CDl LEU A 486 25.210 

ATOM 1412 CD2 LEU A 486 26.309 

ATOM 1413 C LEU A 486 29.662 

ATOM 1414 O LEU A 486 29.745 

ATOM 1415 N ILE A 487 30.151 

ATOM 1416 CA ILE A 487 30.843 

ATOM 1417 CB ILE A 487 31.203 

ATOM 1418 CG2 ILE A 487 32.255 

ATOM 1419 CGI ILE A 487 29.937 

ATOM 1420 CDl ILE A 487 29.237 

ATOM 1421 C ILE A 487 32.125 

ATOM 1422 O ILE A 487 32.497 

ATOM 1423 N HIS A 488 32.791 

ATOM 1424 CA HIS A 488 34.031 

ATOM 1425 CB HIS A 488 34.585 

ATOM 1426 CG HIS A 488 35.799 

ATOM 1427 CD2 HIS A 488 35.970 
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492 
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37.034 

37.913 

37.293 

33.799 

34.577 

32.721 

32.384 

31.145 

31.310 

29.945 

32.183 

32.124 

32.587 

31.387 

31.056 

30.000 

28.607 

27.457 

26.321 

32.287 

32.412 

33.184 

34.407 

35.168 

35.275 

35.865 

35.339 

36.122 

36.136 

37.490 

37.390 

38.631 

38.357 

35.534 

36.227 

34.254 

33.590 

32.097 

33.816 

33.277 

34.604 

34.903 

33.857 

33.916 

32.905 

31.876 

30.713 

29.540 



4.239 
4.466 
4.381 
6.051 
7.004 
5.958 
6.992 
6.587 
5.353 
4.856 
5.701 
8.320 
9.365 
8.274 
9.482 
9.161 
8.940 
8.247 
7.408 
10.108 
11.330 
9.262 
9.730 
8.554 
1 0.445 
11.487 
9.876 
1 0.440 
9.477 
8.840 
7.830 
6.937 
5.577 
11.780 
12.604 
11.992 
13.238 
13.001 
14.305 
15.410 
13.960 
14.904 
15.060 
16.01 1 
14.138 
14.248 
13.304 
13.90! 



8.946 
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S.223 
7.258 
6.464 
5.574 
5.125 
4.378 
7.954 
7.507 
9.058 
9.801 
10.862 
10.289 
1 1 .496 
10.418 
10.455 
10.517 
10.949 
11.585 
12.185 
10.550 
10.838 
9.347 
8.248 
7.052 
6.744 
5.595 
5.518 
4.948 
7.809 
7.215 
8.100 
7.728 
7.528 
8.796 
8.707 
9.811 
10.873 
11.965 
12.747 
12.043 
13.068 
12.769 
1 1.988 
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1.00 
1.00 
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1.00 
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1.00 
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1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



43.13 
43.40 
45.63 
32.74 
31.06 
33.56 
30.78 
34.67 
34.73 
33.21 
35.92 
33.97 
33.22 
31.33 
30.61 
32.34 
30.71 
31.14 
30.36 
32.22 
28.25 
33.81 
39.92 
37.22 
42.68 
45.32 
44.39 
44.80 
46.96 
47.20 
45.71 
45.55 
36.28 
45.61 
46.18 
43.75 
42.42 
40.92 
41.78 
40.76 
41.01 
41.63 
41.18 
38.22 
39.53 
38.91 
39.20 
40.73 
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ATOM 1476 CDl LEU A 495 

ATOM 1477 CD2 LEU A 495 

ATOM 1478 C LEU A 495 

ATOM 1479 O LEU A 495 

5 ATOM 1480 N THR A 496 

ATOM 1481 CA THR A 496 

ATOM 1482 CB THR A 496 

ATOM 1483 OGl THR A 496 

ATOM 1484 CG2 THR A 496 

10 ATOM 1485 C THR A 496 

ATOM 1486 O THR A 496 

ATOM 1487 N LEU A 497 

ATOM 1488 CA LEU A 497 

ATOM 1489 CB LEU A 497 

15 ATOM 1490 CG LEU A 497 

ATOM 1491 CDl LEU A 497 

ATOM 1492 CD2 LEU A 497 

ATOM 1493 C LEU A 497 

ATOM 1494 O LEU A 497 

20 ATOM 1495 N GLN A 498 

ATOM 1496 CA GLN A 498 

ATOM 1497 CB GLN A 498 

ATOM 1498 CG GLN A 498 

ATOM 1499 CD GLN A 498 

25 ATOM 1500 OEl GLN A 498 

ATOM 1501 NE2 GLN A 498 

ATOM 1502 C GLN A 498 

ATOM 1503 O GLN A 498 

ATOM 1504 N GLN A 499 

30 ATOM 1505 CA GLN A 499 

ATOM 1506 CB GLN A 499 

ATOM 1507 CG GLN A 499 

ATOM 1508 CD GLN A 499 

ATOM 1509 OEl GLN A 499 

35 ATOM 1510 NE2 GLN A 499 

ATOM 1511 C GLN A 499 

ATOM 1512 O GLN A 499 

ATOM 1513 N GLN A 500 

ATOM 1514 CA GLN A 500 

40 ATOM 1515 CB GLN A 500 

ATOM 1516 CG GLN A 500 

ATOM 1517 CD GLN A 500 

ATOM 1518 OEl GLN A 500 
ATOM 1519 NE2 GLN A 500 

45 ATOM 1520 C GLN A 500 

ATOM 1521 O GLN A 500 

ATOM 1522 N HIS A 501 

ATOM 1523 CA HIS A 501 
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29.976 14.170 10.553 1.00 37.80 

28.349 12.943 12.026 1.00 40.94 

32.461 13.923 14.431 1.00 36.01 
33.347 13.074 14.544 1.00 34.85 
31.979 14.604 15.459 1.00 37.52 

32.462 14.350 16.812 1.00 35.45 
31.925 15.375 17.829 1.00 37.55 
30.498 15.263 17.908 1.00 32.93 
32.315 16.797 17.434 1.00 36.16 
31.933 12.987 17.210 1.00 35.67 
31.081 12.427 16.521 1.00 34.34 
32.429 12.452 18.319 1.00 34.88 
31.965 11.151 18.786 1.00 35.67 
32.689 10.760 20.074 1.00 41.10 
33.714 9.640 19.896 1.00 45.27 
34.755 9.692 21.008 1.00 45.09 
32.988 8.305 19.884 1.00 47.77 
30.455 11.198 19.026 1.00 33.72 
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2 1 .686 
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15.176 


23.008 


1.00 


40.02 


27.776 


12.476 


18.773 


1.00 


30.47 


26.682 


1 1 .927 


1 8.665 


1.00 


30.85 


28.311 


13.196 


17.793 


1.00 


29.52 


27.603 


13.362 


16.524 


1.00 


30.24 


28.292 


14.420 


15.661 


1.00 


30.20 


28.135 


1 5.840 


16.191 


1.00 


31.60 


28.930 


16.849 


15.389 


1.00 


31.61 


29.956 


16.518 


14.795 


1.00 


30.66 


28.457 


18.089 


15.364 


1.00 


34.17 


27.529 


12.047 


15.753 


1.00 


29.40 


26.567 


11.793 


15.032 


1.00 


30.04 


28.550 


11.214 


15.903 


1.00 


25.67 


28.577 


9.937 


15.216 


1.00 


29.30 


29.933 


9.276 


15.406 


1.00 


31.52 


31.012 


9.839 


14.508 


1.00 


33.05 


32.371 


9.370 


14.930 


1.00 


34.84 


32.612 


8.194 


15.141 


1.00 


36.47 


33.301 


10.324 


15.082 


1.00 


38.25 
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B 
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Q AQ1 
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2504 


CB 


LEU 


B 


378 




1 n 
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2505 
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B 


378 


d Old 
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ATOM 


2506 


CDl 


LEU 


B 


378 




1 Z* l*to 


ATOM 


2507 


CD2 


LEU 


B 

A_>> 


378 


5 004 




ATOM 


2508 


C 


LEU 


B 


378 




R All 


ATOM 


2509 


O 


LEU 

A_/ 


B 

A_/ 


378 




7 ion 


ATOM 


2510 


N 


LEU 


B 


379 


v.OOO 


o.O 1 U 


ATOM 


2511 


CA 


LEU 


B 


379 




7 4R/i 


ATOM 


2512 


CB 


LEU 

^^^^^^ 


B 


219 


-0 596 


7 080 


ATOM 


2513 


CG 


LEU 


B 


379 


0 909 




ATOM 


2514 


CDl 


LEU 


B 


379 


-0 578 


0 no9 

7.U^Z 


ATOM 


2515 


CD2 


LEU 


B 


379 


0 851 


7 075 


ATOM 


2516 


C 


LEU 


B 


379 


-0 518 


^ ^05 


ATOM 


2517 


0 


LEU 


B 


379 


-0 476 


5 177 


ATOM 


2518 


N 


GLU 


B 


380 




7 999 


ATOM 


2519 


CA 


GLU 


B 


380 


-2 086 


6 435 


ATOM 


2520 


CB 


GLU 


B 


380 


-2 994 


7 150 


ATOM 


2521 


CG 


GLU 


B 


380 


-4 182 


1 874 


ATOM 


2522 


CD 


GLU 


B 


380 


-5 070 


8 789 


ATOM 


2523 


OEl 


GLU 


B 


380 


-6 206 


9 066 


ATOM 


2524 


OE2 


GLU 


B 


380 

\ir 


-4 631 


9 230 


ATOM 


2525 


C 


GLU 


B 


380 


-1.210 


5 594 


ATOM 


2526 


0 


GLU 


B 


380 


-1.586 


4.491 


ATOM 


2527 


N 


ACYS B 


381 


-0.039 


6.113 


ATOM 


2528 


N 


BCYS B 


381 


-0.035 


6.113 


ATOM 


2529 


CA 


ACYS B 


381 


0.860 


5.384 


ATOM 


2530 


CA 


BCYS B 


381 


0.875 


5.407 


ATOM 


2531 


CB 


ACYS B 


381 


1.870 


6.342 



21.221 

20.599 

20.140 

19.717 

18.877 

20.954 

18.965 

18.938 

18.007 

16.843 

15.886 

14.667 

14.287 

13.683 

12.750 

13.093 

17.210 

16.719 

18.072 

18.474 

19.388 

18.736 

19.796 

17.638 

19.161 

18.859 

20.077 

20.795 

21.968 

23.214 

24.211 

23.842 

19.872 

19.968 

18.971 

18.049 

1 7.222 

18.007 

17.188 

1 7.625 

16.110 

17.117 

16.722 

16.772 

16.779 

15.887 

15.884 

15.248 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

0.75 

0.25 

0.75 

0.25 

0.75 



23.56 

21.87 

23.44 

22.09 

24.14 

16.96 

24.15 

22.26 

25.85 

25.68 

25.27 

30.63 

31.03 

35.05 

33.12 

31.06 

24.37 

21.47 

19.24 

20.09 

18.64 

22.15 

22.51 

20.68 

18.91 

17.52 

18.25 

18.60 

16.77 

17.90 

15.84 

22.09 

20.17 

18.11 

19.40 

20.19 

22.43 

25.30 

29.44 

31.70 

31.75 

18.92 

19.83 

17.41 

17.76 

20.19 

17 50 

24.20 
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ATOM 


2532 


CB 


BCYS B 


381 


1.830 


6.406 


15.226 


0.25 


16.63 




ATOM 


2533 


SG 


ACYS B 


381 


1.167 


7.518 


14.060 


0.75 


33.54 




ATOM 


2534 


SG 


BCYS B 


381 


3.048 


5.656 


14.128 


0.25 


10.36 




ATOM 


2535 


C 


ACYS B 


381 


1.626 


4.269 


16.592 


0.75 


20.59 


5 


ATOM 


2536 


C 


BCYS B 


381 


1.689 


4.305 


16.561 


0.25 


19.19 




ATOM 


2537 


O 


ACYS B 


381 


1.737 


3.161 


16.069 


0.75 


19.16 




ATOM 


2538 


0 


BCYS B 


381 


1.904 


3.241 


15.982 


0.25 


19.25 




ATOM 


2539 


N 


ALA 


B 


382 


2.134 


4.560 


17.785 


1.00 


19.04 




ATOM 


2540 


CA 


ALA 


B 


382 


2.955 


3.602 


18.530 


1.00 


20.27 


10 


ATOM 


2541 


CB 


ALA 


B 


382 . 


4.135 


4.364 


19.143 


1.00 


18.68 




ATOM 


2542 


C 


ALA 


B 


382 


2.356 


2.702 


19.607 


1.00 


16.82 




ATOM 


2543 


O 


ALA 


B 


382 


3.070 


1.852 


20.142 


1.00 


13.37 




ATOM 


2544 


N 


TRP 


B 


383 


1.074 


2.855 


19.916 


1.00 


15.30 




ATOM 


2545 


CA 


TRP 


B 


383 


0.487 


2.089 


21.013 


1.00 


15.80 


15 


ATOM 


2546 


CB 


TRP 


B 


383 


- 1 .009 


2.410 


21.160 


1.00 


16.63 




ATOM 


2547 


CG 


TRP 


B 


383 


-1.871 


1.775 


20.129 


1.00 


19.93 




ATOM 


2548 


CD2 


TRP 


B 


383 


-2.493 


0.483 


20.198 


1.00 


20.80 




ATOM 


2549 


CE2 


TRP 


B 


383 


-3.226 


0.309 


1 9.003 


1.00 


19.27 




,ATOM 


2550 


CE3 


TRP 


B 


383 


-2.506 


-0.542 


21.155 


1.00 


21.32 


20 


ATOM 


2551 


CDl 


TRP 


B 


383 


-2.236 


2.312 


18.933 


1.00 


18.59 




ATOM 


2552 


NEl 


TRP 


B 


383 


-3.051 


1.439 


18.250 


1.00 


23.67 




ATOM 


2553 


CZ2 


TRP 


B 


383 


-3.963 


-0.853 


18.733 


1.00 


21.55 




ATOM 


2554 


CZ3 


TRP 


B 


383 


-3.243 


-1.702 


20.888 


1.00 


20.29 




ATOM 


2555 


CH2 


TRP 


B 


383 


-3.960 


-1.844 


19.686 


1.00 


19.03 


25 


ATOM 


2556 


C 


TRP 


B 


383 


0.701 


0-579 


21.020 


1.00 


17.35 




ATOM 


2557 


O 


TRP 


B 


383 


0.982 


0.010 


22.077 


1.00 


13.92 




ATOM 


2558 


N 


LEU 


B 


384 


0.568 


-0.087 


19.879 


1.00 


14.07 




ATOM 


2559 


CA 


LEU 


B 


384 


0.773 


-1.532 


19.903 


1.00 


15.98 




ATOM 


2560 


CB 


LEU 


B 


384 


0.181 


-2.200 


18.656 


1.00 


12.19 


30 


ATOM 


2561 


CG 


LEU 


B 


384 


0.173 


-3.735 


18.720 


1.00 


12.97 




ATOM 


2562 


CDl 


LEU 


B 


384 


-0.352 


-4.240 


20.089 


1.00 


10.65 




ATOM 


2563 


CD2 


LEU 


B 


384 


-0.707 


-4.259 


17.586 


1.00 


17.84 




ATOM 


2564 


C 


LEU 


B 


384 


2.262 


-1.861 


20.034 


1.00 


14.64 




ATOM 


2565 


O 


LEU 


B 


384 


2.627 


-2.833 


20.690 


1.00 


13.78 


35 


ATOM 


2566 


N 


GLU 


B 


385 


3.116 


-1.046 


19.414 


1.00 


14.96 




ATOM 


2567 


CA 


GLU 


B 


385 


4.565 


-1.260 


19.509 


1.00 


13.79 




ATOM 


2568 


CB 


GLU 


B 


385 


5.336 


-0.179 


18.739 


1.00 


15.34 




ATOM 


2569 


CG 
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B 


385 


5.297 


-0.312 


1 7.207 


1.00 


15.38 




ATOM 


2570 


CD 
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B 


385 


6. 1 62 


0.738 


16.520 


1.00 


23.97 


40 


ATOM 


2571 


OEl 
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B 


385 


7.381 


0.500 


16.358 


1.00 


21.03 




ATOM 


2572 


OE2 


GLU 


B 


385 


5.622 


1.808 


16.149 


1.00 


22.19 




ATOM 


2573 


C 


GLU 


B 


385 


4.963 


-1.161 


20.987 


1.00 


15.79 




ATOM 


2574 


O 


GLU 


B 


385 


5.788 


-1.942 


21.463 


1.00 


15.04 
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N 
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B 


386 


4.389 


-0.213 


2 1 .690 


1.00 


13.32 


45 


ATOM 


2576 


CA 
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B 


386 


4.723 


-0.019 


23.108 


1.00 


14.06 




ATOM 


2577 


CB 
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B 


386 


4.173 


1.326 


23.614 


1.00 


15.36 




ATOM 


2578 


CG2 


ILE 


B 


386 


4.374 


1.451 


25.130 


1.00 


15.97 




ATOM 


2579 


CGI 
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B 


386 


4.910 


2.476 


22.907 


1.00 


17.95 
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ATOM 
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GLY 


B 
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ATOM 


2608 
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B 
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ATOM 


2609 
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GLY 


B 


390 


ATOM 


2610 


O 


GLY 


B 
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ATOM 


2611 
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LEU 


B 
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ATOM 


2612 


CA 


LEU 


B 


391 


ATOM 


2613 


CB 


LEU 


B 


391 


ATOM 


2614 


CG 


LEU 


B 


391 



ATOM 2615 CDl LEU B 391 

ATOM 2616 CD2 LEU B 391 

ATOM 2617 C LEU B 391 

ATOM 2618 O LEU B 391 

ATOM 2619 N VAL B 392 

ATOM 2620 CA VAL B 392 

ATOM 2621 CB VAL B 392 

ATOM 2622 CGI VAL B 392 

ATOM 2623 CG2 VAL B 392 

ATOM 2624 C VAL B 392 

ATOM 2625 O VAL B 392 

ATOM 2626 N TRP B 393 

ATOM 2627 CA TRP B 393 



PCT/US99/06937 



4.118 


3.768 


22.874 


1.00 


21.12 


4.227 


-1.164 


23.993 


1.00 


14.97 


4.905 


-1.560 


24.941 


1.00 


19.60 


3.038 


-1.675 


23.709 


1.00 


15.18 


2.516 


-2.791 


24.478 


1.00 


15.98 


1.070 


-3.097 


24.080 


1.00 


17.15 


-0:03 I 


-2.113 


24.486 


1.00 


19.65 


-1.371 


-2.628 


23.972 


1.00 


17.77 


-0.075 


-1.966 


26.002 


1.00 


15.38 


3.391 


-4.013 


24.180 


1.00 


14.69 


3.712 


-4.792 


25.076 


1.00 


14.03 


3.785 


-4. 1 78 


22.921 


1.00 


16.43 


4.602 


-5.329 


22.547 


1.00 


16.67 


4.673 


-5.460 


2 1 .026 


1.00 


14.83 


3.403 


-6.066 


20.453 


1.00 


13.91 


3.364 


-6. 1 93 


18.675 


1.00 


17.23 


1.906 


-7.225 


18.511 


1.00 


14.97 


6.004 


-5.332 


23.133 


1.00 


20.19 


6.460 


-6366 


23.636 


1.00 


21.50 


6.707 


-4.203 


23.074 


1.00 


15.34 


8.044 


-4.209 


23.634 


1.00 


15.59 


8.836 


-2.9 1 1 


23.322 


1.00 


14.95 


8.330 


-1.746 


24.158 


1.00 


12.81 


10.325 


-3.164 


23.602 


1.00 


17.24 


1 1 .228 


-1.972 


23.357 


1.00 


15.65 


7.950 


-4.446 


25.147 


1.00 


14.30 


8.844 


-5.044 


25.739 


1.00 


18.72 


6.855 


-4.007 


25.761 


1.00 


13.99 


6.681 


-4.219 


27.189 


1.00 


14.87 


6.444 


-5.702 


27.463 


1.00 


18.54 


6.989 


-6.282 


28.403 


1.00 


16.54 


5.623 


-6.325 


26.628 


1.00 


16.15 


5.334 


-7.743 


26.775 


1.00 


18.91 


4.332 


-8.179 


25.699 


1.00 


19.55 


4.157 


-9.689 


25.457 


1.00 


20.91 


3.580 


-10.351 


26.699 


1.00 


19.41 


3.232 


-9.913 


24.268 


1.00 


20.70 


6.649 


-8.518 


26.625 


1.00 


20.31 


7.002 


-9.352 


27.465 


1.00 


18.66 


7.378 


-8.215 


25.557 


1.00 


18.71 


8.649 


-8.868 


25.278 


1.00 


19.51 


9.288 


-8.281 


24.005 


1.00 


23.77 


10.751 


-8.687 


23.920 


1.00 


24.63 


8.520 


-8.773 


22.767 


1.00 


19.94 


9.615 


-8.707 


26.450 


1.00 


22.80 


10.336 


-9.637 


26.811 


1.00 


19.36 


9.617 


-7.522 


27.046 


1.00 


22.10 


1 0.492 


-7.241 


28.171 


1.00 


23.20 
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ATOM 2628 CB TRP B 393 

ATOM 2629 CG TRP B 393 

ATOM 2630 CD2 TRP B 393 

ATOM 2631 CE2 TRP B 393 

5 ATOM 2632 CE3 TRP B 393 

ATOM 2633 CDl TRP B 393 

ATOM 2634 NEl TRP B 393 

ATOM 2635 CZ2 TRP B 393 

ATOM 2636 CZ3 TRP B 393 

10 ATOM 2637 CH2 TRP B 393 

ATOM 2638 C TRP B 393 

ATOM 2639 O TRP B 393 

ATOM 2640 N ARG B 394 

ATOM 2641 CA ARG B 394 

15 ATOM 2642 CB ARG B 394 

ATOM 2643 CG ARG B 394 

ATOM 2644 CD ARG B 394 

ATOM 2645 NE ARG B 394 

ATOM 2646 CZ ARG B 394 

20 ATOM 2647 NHl ARG B 394 

ATOM 2648 NH2 ARG B 394 

ATOM 2649 C ARG B 394 

ATOM 2650 O ARG B 394 

ATOM 2651 N SER B 395 

25 ATOM 2652 CA SER B 395 

ATOM 2653 CB SER B 395 

ATOM 2654 OG SER B 395 

ATOM 2655 C SER B 395 

ATOM 2656 O SER B 395 

30 ATOM 2657 N MET B 396 

ATOM 2658 CA MET B 396 

ATOM 2659 CB MET B 396 

ATOM 2660 CG MET B 396 

ATOM 2661 SD MET B 396 

35 ATOM 2662 CE MET B 396 

ATOM 2663 C MET B 396 

ATOM 2664 O MET B 396 

ATOM 2665 N GLU B 397 

ATOM 2666 CA GLU B 397 

40 ATOM 2667 CB GLU B 397 

ATOM 2668 CG GLU B 397 

ATOM 2669 CD GLU B 397 

ATOM 2670 OEl GLU B 397 

ATOM 2671 OE2 GLU B 397 

45 ATOM 2672 C GLU B 397 

ATOM 2673 O GLU B 397 

ATOM 2674 N HIS B 398 

ATOM 2675 CA HIS B 398 
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10.388 


-5.773 


28.578 


1.00 


19.22 


11.056 


-5.479 


29.895 


1.00 


22.53 


12.453 


-5.591 


30.193 


1.00 


20.36 


12.624 


-5.208 


31.545 


1.00 


25.65 


13.578 


-5.976 


29.449 


1.00 


22.12 


10.452 


-5.046 


31.044 


1.00 


23.02 


11.387 


-4.881 


32.037 


1.00 


24.91 


13.876 


-5.200 


32.171 


1.00 


23.00 


14.829 


-5.968 


30.072 


1.00 


23.98 


14.964 


-5.582 


3 1 .423 


1.00 


23.20 


10.208 


-8.114 


29.388 


1.00 


24.36 


11.128 


-8.717 


29.944 


1.00 


23.04 


8.952 


-8.189 


29.819 


1.00 


21.29 


8.680 


-9.003 


30.990 


1.00 


22.43 


7.365 


-8.601 


31.667 


1.00 


23.97 


6.259 


-8.149 


30.759 


1.00 


26.16 


•5.026 


-7.727 


31.574 


1.00 


20.86 


3.817 


-7.937 


30.786 


1.00 


19.54 


3.327 


-7.059 


29.915 


1.00 


20.58 


3.944 


-5.902 


29.722 


1.00 


17.41 


2.229 


-7.347 


29.220 


1.00 


16.82 


8.695 


-10.502 


30.713 


1.00 


21.78 


8.657 


- 1 1 .294 


3 1 .648 


1.00 


23.44 


8.767 


-10.880 


29.438 


1.00 


17.10 


8.805 


-12.289 


29.041 


1.00 


25.08 


8.206 


-12.473 


27.638 


1.00 


19.47 


6.832 


-12.136 


27.619 


1.00 


21.73 


10.239 


-12.831 


29.031 


1.00 


26.29 


10.458 


-14.030 


28.854 


1.00 


23.75 


1 1 .206 


-11.938 


29.210 


1.00 


30.79 


12.620 


-12.307 


29.205 


1.00 


35.07 


13.479 


-11.063 


29.423 


1.00 


33.84 


14.155 


-10.569 


28.171 


1.00 


36.88 


15.149 


-9.127 


28.491 


1.00 


40.96 


16.675 


-9.849 


28.998 


1.00 


39.67 


12.983 


-13.353 


30.250 


1.00 


35.88 


13.828 


-14.215 


30.011 


1.00 


34.52 


12.348 


-13.266 


31.410 


1.00 


36.19 


1 2.604 


-14.206 


32.492 


1.00 


39.24 


12.153 


-13.605 


33.821 


1.00 


44.38 


12.983 


-12.422 


34.271 


1.00 
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B 


420 


A T^/^X A 

A 1 Uiyl 


2848 


c 


GLY 


B 


420 


A 'Frf^X A 

A 1 UM 


2849 


o 


GLY 


B 


420 


A nr/^x A 
A 1 UM 


2850 


N 


MET 


B 


421 


A T*/^X A 

A 1 UM 


2851 


CA 


MET 


B 


421 


A TrWA 

A 1 UM 


2852 


CB 


^ ^^^^ 

MET 


B 


421 


A 1 UM 


2853 


CG 


MET 


B 


421 


A TO\/f 
A 1 UM 


2854 


SD 


MET 


B 


421 


A TOlV/f 
A 1 UM 


T O C C 

2855 


CE 


MET 


B 


421 


ATOX/f 
A 1 UM 


2856 


C 


MET 


B 


421 


A TOKyf 
A 1 Um 


O O C7 

2857 


o 


MET 


B 


421 


ATOM 


oo<o 
2o!)o 


XT 

N 


X 7 A T 

VAL 


B 


422 


ATO\/f 


253y 


CA 


VAL 


B 


422 


ATOM 


ZooU 


CB 


T 7 A T 

VAL 


B 


422 


ATOM 


zoo 1 


CGI 


VAL 


B 


422 


ATOM 


0 9/^O 
ZoOZ 




■\ 7 A T 

VAL 


B 


422 


ATOM 


2863 


c 


VAL 


B 


422 


ATOM 


2864 


0 


VAL 


B 


422 


ATOM 


2865 


N 


GLU 


B 


423 


ATOM 


2866 


CA 


GLU 


B 


423 


ATOM 


2867 


CB 


GLU 


B 


423 



1 



0.306 


-25.541 


17.346 


1.00 


41.85 


1.286 


-24.760 


16.542 


1.00 


45.63 


•4.321 


-21.64^ 


18.559 


1.00 


36.93 


-5.121 


-21.790 


17.638 


1.00 


38.36 


-4.698 


-21.430 


19.817 


1.00 


37.10 


y^ • y\ y* 

-6.106 


-21,371 


20.196 


1.00 


36.46 


y" ^ « y> 

-6.218 


-21.226 


21.717 


1.00 


39.01 


-5.674 


-22.710 


22.612 


1.00 


43.81 


-6.899 


-20,277 


19.491 


1.00 


35.19 


-8.127 


-20.296 


19.485 


1.00 


33.92 


-6.195 


-19.316 


18.906 


1.00 


36.04 


-6.838 


-18.236 


18.163 


1.00 


34.59 


-6.525 


-16.850 


18.775 


1.00 


34.87 


-6.83 1 


-15.763 


17.765 


1.00 


35.32 


-7.350 


-16.630 


20.036 


1.00 


33.65 


-6.241 


-18.317 


16.764 


1.00 


34.17 


-5.020 


-18.323 


16.61 1 


1.00 


32.73 


-7.084 


-18.388 


15.740 


1.00 


33.44 


-6.554 


-18.500 


14.390 


1.00 


34.52 


-7.681 


-18.722 


13.380 


1.00 


36.21 


-8.597 


-17.538 


13.166 


1.00 


44.19 


-9.477 


-17.723 


1 1 .946 


1.00 


48.47 


-9.157 


-18.605 


11.119 


1.00 


51.04 


-10.484 


-16.993 


11.813 


1.00 


48.91 


-5.717 


-17.289 


13.997 


1.00 


32.89 


-6.156 


-16.144 


14.123 


1.00 


31.09 


-4.501 


-17.562 


13.535 


1.00 


32.84 


-3.594 


-16.506 


13.122 


1.00 


34.37 


-2.722 


-15.955 


14.240 


1.00 


35.30 


-1.745 


-15.246 


13.975 


1.00 


35.94 


-3.052 


-16.285 


15.486 


1.00 


30.08 


-2.289 


-15.780 


16.625 


1.00 


29.22 


-3.108 


-15.922 


17.914 


1.00 


22.54 


-2.469 


-15.270 


19.124 


1.00 


23.82 


-2. 124 


-13.494 


18.872 


1.00 


28.40 


-3.697 


-12.800 


19.233 


1.00 


24.67 


-0.912 


-16.416 


16.821 


1.00 


29.67 


0,022 


-15.751 


17.269 


1.00 


29.76 


-0.766 


-17.694 


16.484 


1.00 


30.63 


0.524 


-18.338 


16.675 


1.00 


29.90 


y% ^ y*^ ^ 

0.482 


-19.835 


16.273 


1.00 


35.74 


0.514 


-19.992 


14.753 


1.00 


37.64 


1.659 


-20.555 


16.897 


1.00 


31.68 


1.669 


-17.640 


15.935 


1.00 


28.64 


2.788 


-17.571 


16.441 


1.00 


26.15 


1.402 


-17.113 


14.747 


1.00 


28.70 


2.454 


-16.435 


13.997 


1.00 


31.34 


1.963 


-16.050 


12.596 


1.00 


36.21 
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ATOM 2868 CG GLU B 423 

ATOM 2869 CD GLU B 423 

ATOM 2870 OEl GLU B 423 

ATOM 2871 0E2 GLU B 423 

5 ATOM 2872 C GLU B 423 

ATOM 2873 O GLU B 423 

ATOM 2874 N ILE B 424 

ATOM 2875 CA ILE B 424 

ATOM 2876 CB ILE B 424 

10 ATOM 2877 CG2 ILE B 424 

ATOM 2878 CGI ILE B 424 

ATOM 2879 CD I ILE B 424 

ATOM 2880 C ILE B 424 

ATOM 2881 O ILE B 424 

1 5 ATOM 2882 N PHE B 425 

ATOM 2883 CA PHE B 425 

ATOM 2884 CB PHE B 425 

ATOM 2885 CG PHE B 425 

ATOM 2886 CDl PHE B 425 

20 ATOM 2887 CD2 PHE B 425 

ATOM 2888 CEl PHE B 425 

ATOM 2889 CE2 PHE B 425 

ATOM 2890 C2 PHE B 425 

ATOM 2891 C PHE B 425 

25 ATOM 2892 O PHE B 425 

ATOM 2893 N ASP B 426 

ATOM 2894 CA ASP B 426 

ATOM 2895 CB ASP B 426 

ATOM 2896 CG ASP B 426 

30 ATOM 2897 ODI ASP B 426 

ATOM 2898 OD2 ASP B 426 

ATOM 2899 C ASP B 426 

ATOM 2900 O ASP B 426 

ATOM 2901 N MET B 427 

35 ATOM 2902 CA MET B 4^7 

ATOM 2903 CB MET B 427 

ATOM 2904 CG MET B 427 

ATOM 2905 SD MET B 427 

ATOM 2906 CE MET B 427 

40 ATOM 2907 C MET B 427 

ATOM 2908 O MET B 427 

ATOM 2909 N LEU B 428 

ATOM 2910 CA LEU B 478 

ATOM 2911 CB LEU B 428 

45 ATOM 2912 CG LEU B 428 

ATOM 2913 CD I LEU B 428 

ATOM 2914 CD2 LEU B 4^8 

ATOM 2915 C LEU B 428 



0.502 -16.376 12.325 1.00 45 83 

0.250 -17.865 12.144 1.00 46 71 

-0.746 -18.368 12.706 1.00 45 97 

1.045 -18.530 11.442 1.00 50 05 

2.928 -15.186 14.744 1.00 30 57 

4.119 -14.870 14.759 1.00 "^6 59 

2.001 -14.478 15.378 1.00 ^6 19 

2.381 -13.279 16.111 1.00 26 23 

1.134 -12.435 16.452 1.00 ^9 33 

1.492 -11.315 17.425 1.00 3091 

0.584 -11.817 15.160 1.00 ^9 09 

-0.895 -11.514 15.187 1.00 30 51 

3.153 -13.673 17.370 1.00 24 22 

4.152 -13.037 17.725 1.00 7] '05 

2.708 -14.746 18.023 1.00 21 71 

3.370 -15.236 19.223 1.00 18 85 

2.650 -16.479 19.768 1.00 ^2 98 

1.580 -16.183 20.795 1.00 22 17 

1.287 -17.112 21.792 1.00 ^5 47 

0.843 -15.001 20.747 1.00 ^6 30 

,0.273 -16.871 22.724 1.00 7433 

-0.174 -14.749 21.676 1.00 25 03 

-0.459 -15.684 22.663 1.00 26 44 

4.817 -15.610 18.885 1.00 20 00 

5.741 -15.292 19.636 1.00 21 15 

5.023 -16.281 17.754 1.00 19 87 

6.378 -16.685 17.377 1.00 23 20 

6.364 -17.510 16.090 1.00 :'6 53 

5.992 -18.965 16.335 1.00 34 28 

6.242 -19.467 17.455 1.00 35 24 

5.448 -19.600 15.409 1.00 31 49 

7.302 -15.489 17.198 1.00 84 

8.465 -15.526 17.593 1.00 21 55 

6.788 -14.429 16.591 1.00 ^0 12 

7.597 -13.234 16.382 1.00 21 0^ 

6.836 -12.228 15.520 1.00 18 53 

6.864 -12.559 14.038 1.00 27 9^ 

6.011 -11.341 13.024 1.00 32 84 

4.363 -11.532 13.581 1.00 33 63 

7.945 -12.616 17.732 1.00 1749 

9.073 -12.180 1 7.950 1.00 ^2 09 

6.968 -12.597 18.634 1. 00 70 47 

7.157 -12.033 19.968 1.00 ^0 13 

5.812 -11.964 20.706 1.00 17 58 

4.852 -10.887 20.179 1.00 18 41 

3.443 -11.155 20.687 1.00 11 95 

5.324 -9.505 20.631 1.00 17 80 

8.159 -12.856 20.767 1.00 ^0 68 
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ATOM 2916 O LEU B 428 

ATOM 2917 N LEU B 429 

ATOM 2918 CA LEU B 429 

ATOM 2919 CB LEU B 429 

5 ATOM 2920 CG LEU B 429 

ATOM 2921 CDl LEU B 429 

ATOM 2922 CD2 LEU B 429 

ATOM 2923 C LEU B 429 

ATOM 2924 O LEU B 429 

10 ATOM 2925 N ALA B 430 

ATOM 2926 CA ALA B 430 

ATOM 2927 CB ALA B 430 

ATOM 2928 C ALA B 430 

ATOM 2929 O ALA B 430 

15 ATOM 2930 N THR B 431 

ATOM 2931 CA THR B 431 

ATOM 2932 CB THR B 431 

ATOM 2933 OGl THR B 431 

ATOM 2934 CG2 THR B 431 

20 ATOM 2935 C THR B 431 

ATOM 2936 O THR B 431 

ATOM 2937 N SER B 432 

ATOM 2938 CA SER B 432 

ATOM 2939 CB SER B 432 

25 ATOM 2940 OG SER B 432 

ATOM 2941 C SER B 432 

ATOM 2942 O SER B 432 

ATOM 2943 N SER B 433 

ATOM 2944 CA SER B 433 

30 ATOM 2945 CB SER B 433 

ATOM 2946 OG SER B 433 

ATOM 2947 C SER B 433 

ATOM 2948 O SER B 433 

ATOM 2949 N ARG B 434 

35 ATOM 2950 CA ARG B 434 

ATOM 2951 CB ARG B 434 

ATOM 2952 CG ARG B 434 

ATOM 2953 CD ARG B 434 

ATOM 2954 NE ARG B 434 

40 ATOM 2955 CZ ARG B 434 

ATOM 2956 NHl ARG B 434 

ATOM 2957 NH2 ARG B 434 

ATOM 2958 C ARG B 434 

ATOM 2959 O ARG B 434 

45 ATOM 2960 N PHE B 435 

ATOM 2961 CA PHE B 435 

ATOM 2962 CB PHE B 435 

ATOM 2963 CG PHE B 435 



9.028 


-12.305 


21.445 


1.00 


AC 

20.45 


8.037 


-14.178 


20.679 


1.00 


20.35 


8.938 


-15.082 


21.382 


1.00 


19.82 


8.470 


-16.532 


21.21 1 


1. 00 


23.13 


7.189 


-16.839 


21.997 


1.00 


21.85 


6.551 


-18.123 


21.494 


1.00 


25.39 


7.537 


- 1 6.944 


23.475 


1.00 


24.91 


10.361 


-14.936 


20.865 


1.00 


20.74 


11.318 


-14.968 


21.638 


1.00 


21.02 


10.495 


-14.770 


19,554 


1.00 


21.40 


11.808 


-14.609 


18.947 


1.00 


22.77 


11.677 


-14.596 


17.432 


1.00 


21.1 1 


12.467 


-13.315 


19.440 


1.00 


22.40 


13.670 


-13.277 


19.713 


1.00 


20.62 


1 1 .670 


-12.258 


19.567 


1.00 


21.09 


12.183 


-10.974 


20.021 


1.00 


22,67 


11.128 


-9.866 


19.863 


1. 00 


23.77 


10.572 


-9.936 


18.547 


1.00 


23.84 


1 1 .762 


-8.489 


20.073 


1.00 


21.78 


12.603 


-11.037 


2 1 .480 


1.00 


21,98 


13.595 


-10.429 


21.879 


1.00 


19.85 


11.844 


- 1 1 .773 


22.280 


1.00 


24.24 


12.169 


-11.906 


23.693 


1.00 


26.96 


11.055 


-12.661 


24.423 


1.00 


28.00 


1 1 .404 


-12.888 


25.116 


1.00 


30.31 


13.491 


-12.660 


23.820 


1.00 


27.67 


14.305 


-12.377 


24.701 


1.00 


23.78 



13.691 -13.628 22.932 1.00 29.27 

14.914 -14.421 22.928 1.00 31.96 

14.790 -15.575 21.938 1.00 30.84 

14.761 -16.808 22.625 1.00 38.26 

16.104 -13.550 22.548 1.00 31.47 

17.204 -13.701 23.087 1.00 28.43 

15.878 -12.641 21.607 1.00 29.55 



16.926 


-11.739 


21.165 


1.00 


29.40 


16.437 


-10.912 


19.977 


1.00 


31.56 


17.428 


-9.868 


19.493 


1.00 


36.76 


18.694 


-10.502 


18.919 


1.00 


37.76 


19.654 


-9.479 


18.516 


1.00 


39.50 


20.965 


-9.673 


18.418 


1.00 


44.17 


21.492 


-10.861 


1 8.696 


1.00 


43.17 


21.750 


-8.671 


18.048 


1.00 


43.05 


17.328 


-10.827 


22.326 


1.00 


29.15 


18.515 


-10.612 


22.569 


1.00 


28.82 


16.337 


-10.297 


23.039 


1.00 


24.88 


16.600 


-9.422 


24.186 


1.00 


25.74 


15.278 


-8.972 


24.825 


1.00 


26.53 


14.656 


-7.758 


24.183 


1.00 


30.94 
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ATOM 2964 CD] PHE B 435 
ATOM 2965 CD2 PHE B 435 

ATOM 2966 CEl PHE B 435 
ATOM 2967 CE2 PHE B 435 

ATOM 2968 CZ PHE B 435 

ATOM 2969 C PHE B 435 

ATOM 2970 O PHE B 435 

ATOM 2971 N ARG B 436 

ATOM 2972 CA ARG B 436 

ATOM 2973 CB ARG B 436 

ATOM 2974 CG ARG B 436 

ATOM 2975 CD ARG B 436 

ATOM 2976 NE ARG B 436 

ATOM 2977 CZ ARG B 436 

ATOM 2978 NHl ARG B 436 

ATOM 2979 NH2 ARG B 436 

ATOM 2980 C ARG B 436 

ATOM 2981 O ARG B 436 

ATOM 2982 N AMETB 437 

ATOM 2983 N BMETB 437 

ATOM 2984 CA AMETB 437 

ATOM 2985 CA BMETB 437 

ATOM 2986 CB AMETB 437 

ATOM 2987 CB BMETB 437 

ATOM 2988 CG AMETB 437 

ATOM 2989 CG BMETB 437 

ATOM 2990 SD AMETB 437 

ATOM 2991 SD BMETB 437 

ATOM 2992 CE AMETB 437 

ATOM 2993 CE BMETB 437 

ATOM 2994 C AMETB 437 

ATOM 2995 C BMETB 437 

ATOM 2996 O AMETB 437 

ATOM 2997 O BMETB 437 

ATOM 2998 N MET B 438 

ATOM 2999 CA MET B 438 

ATOM 3000 CB MET B 438 

ATOM 3001 CG MET B 438 

ATOM 3002 SD MET B 438 

ATOM 3003 CE MET B 438 

ATOM 3004 C MET B 438 

ATOM 3005 O MET B 438 

ATOM 3006 N ASN B 439 

ATOM 3007 CA ASN B 439 

ATOM 3008 CB ASN B 439 

ATOM 3009 CG ASN B 439 

ATOM 3010 ODl ASN B 439 

ATOM 3011 ND2 ASN B 439 



15.118 -7.271 22.966 1.00 3? 65 
13.592 -7.108 24.797 1.00 33 60 
14.529 -6.155 22.372 1.00 36 84 
12.997 -5.989 24.208 1.00 34 96 
13.468 -5.516 22.995 1.00 3164 

17.426 -10.184 25.233 1.00 ^539 
18.414 -9.675 25.764 j 00 ->-> 59 

16.999 -11.405 25.528 1.00 ^458 

17.675 -12.253 26.503 1.00 30 25 

16.898 -13.569 26.662 1.00 33 32 

17.232 -14.358 27.915 1.00 38 17 

16.135 -15.367 28.260 1.00 37.27 

15.646 -16.085 27.086 1 00 43 92 

14.433 -15.923 26.557 1.00 46.68 

13.578 -15.061 27.097 1.00 45 59 

14.074 -16.620 25.486 1.00 46 25 

19.110 -12.531 26.048 1 00 7982 

20.057 -12.397 26.823 1.00 98 76 

19.269 -12.921 24.789 0.50 30 27 

19.252 -12.906 24.781 0.50 31 41 

20.591 -13.212 24.253 0.50 31 98 

20.547 -13.206 24.183 0.50 33 77 

20.489 -13.646 22.788 0.50 31 34 

20.348 -13.595 22.714 0.50 35 88 

20.179 -15.127 22.592 0.50 33.62 

21.605 -13.594 21.861 0.50 40 47 

20.354 -16.099 24.109 0.50 35 21 

21.247 -13.937 20.115 0.50 4679 

22.155 -16.194 24.259 0.50 3320 

21.837 -15.632 19.976 0.50 43 22 

21.498 -11.993 24.366 0.50 33 33 

21.487 -12.005 24.289 0.50 34 45 

22.702 -12.123 24.594 0.50 33 54 

22.699 -12.162 24.438 0.50 34 43 

20.913 -10.809 24.215 1.00 3'> 07 

21.674 -9.560 24.298 1.00 32 48 

20.930 -8.437 23.578 1.00 2974 

21.161 -8.364 22.093 1.00 36 73 

20.425 -6.849 21.462 1.00 38 21 

21.693 -5.657 21.943 1.00 35 91 

21.877 -9.122 25.738 1.00 78 81 

22.686 -8.240 26.013 1.00 30 13 

21.120 -9.72] 26.646 1.00 ?7 14 

21.199 -9.359 28.038 1.00 27 34 

22.592 -9.524 28,598 1.00 34^85 

22.624 -9.480 30.080 1.00 38 58 

21.584 -9.620 30.724 1.00 4799 

23.801 -9.260 30.666 1.00 47 14 
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ATOM 3012 C 

ATOM 3013 O 

ATOM 3014 N 

ATOM 3015 CA 

5 ATOM 3016 CB 

ATOM 3017 CG 

ATOM 3018 CDl 

ATOM 3019 CD2 

ATOM 3020 C 

10 ATOM 3021 O 

ATOM 3022 N 

ATOM 7023 CA 

ATOM 3024 CB 

ATOM 3025 CG 

15 ATOM 3026 CD 

ATOM 3027 OEl 

ATOM 3028 NE2 

ATOM 3029 C 

ATOM 3030 O 

20 ATOM 3031 N 

ATOM 3032 CA 

ATOM 3033 C 

ATOM 3034 O 

ATOM 3035 N 

25 ATOM 3036 CA 

ATOM 3037 CB 

ATOM 3038 CG 

ATOM 3039 CD 

ATOM 3040 OEl 

30 ATOM 3041 0E2 

ATOM 3042 C 

ATOM 3043 O 

ATOM 3044 N 

ATOM 3045 CA 

35 ATOM 3046 CB 

ATOM 3047 CG 

ATOM 3048 CD 

ATOM 3049 OEl 

ATOM 3050 OE2 

40 ATOM 3051 C 

ATOM 3052 O 

ATOM 3053 N 

ATOM 3054 CA 

ATOM 3055 CB 
45 ATOM 3056 CG 
ATOM 3057 CDl 
ATOM 3058 CD2 
ATOM 3059 CEl 



ASN B 
ASN B 
LEU B 
LEU B 
LEU B 
LEU B 
LEU B 
LEU B 
LEU B 
LEU B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLN B 
GLY B 
GLY B 
GLY B 
GLY B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
GLU B 
PHE B 
PHE B 
PHE B 
PHE B 
PHE B 
PHE B 
PHE B 



439 

439 

440 

440 

440 

440 

440 

440 

440 

440 

441 

441 

441 

441 

441 

441 

441 

441 

441 

442 

442 

442 

442 

443 

443 

443 

443 

443 

443 

443 

443 

443 

444 

444 

444 

444 

444 

444 

444 

444 

444 

445 

445 

445 

445 

445 

445 

445 



20.745 

21.396 

19.625 

19.061 

17.761 

17.087 

17.958 

15.704 

18.782 

18.131 

19.268 

19.060 

20.250 

21.572 

21.610 

21.539 

21.703 

17.789 

17.303 

1 7.266 

16.058 

16.217 

15.290 

17.387 

17.635 

1 8.960 

19.005 

19.701 

19.343 

20.607 

17.662 

17.265 

18.128 

18.182 

1 9.046 

20.545 

21.340 

20.817 

22.488 

16.758 

16.377 

15.987 

14.600 

13.989 

12.483 

11.746 

11.802 

10.346 

112 



-7.903 
-7.106 
-7.564 
-6.214 
-6.157 
-4.786 
-3.843 
-4.914 
-5.785 
.6.504 
-4.609 
-4.099 
-3.231 
-3.956 
-5.028 
-4.772 
-6.288 
-3.265 
-2.768 
-3.105 
-2.327 
-0.873 
-0.279 
-0.293 
1.102 
1.590 
1.525 
0.270 
-0.841 
0.394 
1.278 
2.328 
0.253 
0.302 
-0.834 
-0.617 
-1.869 
-2.978 
-1.746 
0.155 
0.822 
-0.730 
-0.969 
-2.067 
-2.055 
-2.386 
- 1 .694 
-2.359 



28.212 
28.891 
27.573 
27.633 
26.818 
26.740 
25.923 
26.111 
29.074 
29.830 
29.452 
30.807 
3 1 .234 
3 1 .228 
32.279 
33.473 
3 1 .823 
30.883 
29.866 
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Appendix 2 

Atomic Coordinates for Human ERa Complexed With OHT 
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ATOM 612 O TRP 383 

ATOM 613 N LEU 384 

ATOM 614 CA LEU 384 

ATOM 615 CB LEU 384 

5 ATOM 616 CG LEU 384 

ATOM 617 CDl LEU 384 

ATOM 618 CD2 LEU 384 

ATOM 619 C LEU 384 

ATOM 620 O LEU 384 
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ATOM 622 CA GLU 385 
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ATOM 629 O GLU 385 
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ATOM 661 O ILE 389 

ATOM 662 N GLY 390 

ATOM 663 CA GLY 390 
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ATOM 673 O LEU 391 
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ATOM 700 NE ARG 394 

ATOM 701 CZ ARG 394 
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ATOM 703 NH2 ARG 394 

45 ATOM 704 C ARG 394 

ATOM 705 O ARG 394 

ATOM 706 N SER 395 

ATOM 707 CA SER 395 
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ATOM 1331 CEl HIS 474 



BNSDOCtD: <WO ^9950658A2J_> 



PCT/US99/06937 



24.002 


19.427 


-0.545 


1.00 


44.47 


25.438 


19.874 


-0.849 


1.00 


46.70 


25.514 


20.477 


-2.246 


1.00 


46.70 


25.890 


20.883 


0.199 


1.00 


47.32 


22.423 


17.996 


0.786 


1.00 


39.06 


21.856 


18.505 


1.760 


1.00 


34.97 


21.814 


17.151 


-0.046 


1.00 


35.46 


20.418 


16.768 


0.145 


1.00 


34.38 


19.914 


15.963 


-1.052 


1.00 


38.02 


19.772 


16.773 


-2.329 


1.00 


42.67 


19.339 


15.923 


-3.509 


1.00 


48.30 


19.671 


14.716 


-3.538 


1.00 


50.53 


18 666 


16.463 


-4.412 


1.00 


51.06 


^0 290 


15.916 


1.403 


1.00 


34.37 


19 321 


16.035 


2.157 


1.00 


32.60 




15.046 


1.606 


1.00 


34.66 


'^l 309 


14.162 


2.766 


1.00 


35.68 


^'^ 515 


13.222 


2.671 


1.00 


34.57 


'^7 376 


12.122 


1.614 


1.00 


37.98 


■^1 476 


10.989 


2.063 


1.00 


39.79 


20 268 


1 1.027 


1.743 


1.00 


41.12 


21 974 


10.061 


2.737 


1.00 


32.11 


'^1 393 


14.983 


4.052 


1.00 


34.79 


20 596 


14.793 


4.969 


1.00 


32.80 


99 358 


15.898 


4.112 


1.00 


33.93 


^^2 518 


16.739 


5.291 


1.00 


35.58 


'?3 683 


17.710 


5.097 


1.00 


39.11 


25 050 


17.050 


5.138 


1.00 


41.47 


26 080 


17.957 


5.794 


1.00 


46.97 


'^l 445 


17.286 


5.862 


1.00 


48.40 


27 850 


16.702 


4.547 


1.00 


51.55 


'>1.237 


17.523 


5.582 


1.00 


34.78 


20-795 


1 7.607 


6.724 


1.00 


33.95 


20.643 


1 8.097 


4.545 


1.00 


33.47 


19.420 


18.865 


4.720 


1.00 


34.63 


18.923 


1 9.404 


3.380 


1.00 


37.21 


17.654 


20.221 


3.522 


1.00 


43.24 


16.559 


19.687 


3.230 


1.00 


45.20 


17.750 


21.396 


3.932 


1.00 


45.59 


18.339 


17.998 


5.338 


1.00 


32.93 


17.642 


18.416 


6.264 


1.00 


32.87 


18.199 


16.784 


4.827 


1.00 


32.74 


17.185 


15.882 


5.343 


1.00 


32.21 


17.185 


14.575 


4.568 


1.00 


32.79 


16.047 


13.675 


4.924 


1.00 


36.22 


14.711 


13.813 


4.750 


1.00 




16.227 


12.456 


5.542 


1.00 


38.97 


15.053 


1 1.883 


5.732 


1.00 


37.99 
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ATOM 
ATOM 
V ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
ATOM 



1332 
1333 
1334 
1335 
1336 
1337 
1338 
1339 
1340 
1341 
1342 
1343 
1344 
1345 
1346 
1347 
1348 
1349 
1350 
1351 
1352 
1353 
1354 
1355 
1356 
1357 
1358 
1359 
1360 
1361 
1362 
1363 
1364 
1365 
1366 
1367 
1368 
1369 
1370 
1371 
1372 
1373 
1374 
1375 
1376 
1377 
1378 
1379 



NE2 
C 
O 
N 
CA 
CB 
CG2 
CGI 
GDI 
C 
O 
N 
CA 
CB 
CG 
CD2 
NDl 
CEl 
NE2 
C 
O 
N 
CA 
CB 
CG 
CD 
NE 
CZ 
NHl 
NH2 
C 
O 
N 
CA 
CB 
CGI 
CG2 
C 
O 
N 
CA 
CB 
CG 
CDl 
CD2 
C 
O 
N 



HIS 
HIS 
HIS 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
ASP 



474 

474 

474 

475 

475 

475 

475 

475 

475 

475 

475 

476 

476 

476 

476 

476 

476 

476 

476 

476 

476 

477 

477 

477 

477 

477 

477 

477 

477 

477 

477 

477 

478 

478 

478 

478 

478 

478 

478 

479 

479 

479 

479 

479 

479 

479 

479 

480 



14.116 
1 7.403 
16.460 
18.653 
18.971 
20.478 
20.877 
20.787 
22.258 
18.576 
17.928 
18.956 
18.621 
19.342 
20.777 
21.355 
21.809 
22.959 
22.712 
17.120 
16.636 
16.374 
14.929 
14.343 
14.425 
13.698 
14.107 
13.647 
12.756 
14.084 
14.310 
13.360 
14.863 
14.351 
14.937 
.14.461 
14.506 
14.696 
13.860 
15.929 
16.360 
1 7.799 
18.910 
20.23 1 
19.028 
15.411 
14.997 
15.076 



12.686 

15.573 

15.543 

15.326 

15.014 

14.708 

14.713 

13.341 

13.071 

16.201 

16.038 

1 7.404 

18.575 

19.796 

19.867 

19.707 

20.067 

20.027 

1 9.809 

18.810 

19.336 

18.396 

18.570 

18.376 

19.627 

1 9.445 

20.456 

21.705 

22.106 

22.558 

17.582 

17.903 

16.375 

15.369 

13.975 

12.973 

13.528 

15.774 

15.677 

16.232 

16.674 

17.210 

16.152 

16.772 

15.603 

17.777 

17.786 

18.703 



5.261 
6.815 
7.596 
7.185 
8.571 
8.720 
10.193 
8.092 
7.849 
9.460 
10.485 
9.054 
9.846 
9.281 
9.699 
10.915 
8.808 
9.456 
10.735 
9.948 
10.951 
8.929 
8.956 
7.557 
6.700 
5.370 
4.399 
4.376 
5.274 
3.457 
9.931 
1 0.649 
9.972 
10.887 
10.575 
11.609 
9.169 
12.316 
13.220 
12.516 
13.836 
13.779 
1 3.853 
13.395 
15.277 
14.313 
15.472 
13.415 



1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 



37.43 

29.74 

29.90 

27.80 

25.61 

25.59 

27.17 

26.17 

27.07 

27.91 

29.16 

29.41 

29.73 

32.27 

39.44 

39.81 

39.79 

39.98 

40.26 

31.40 

29.79 

31.82 

31.53 

34.95 

40.46 

45.22 

53.05 

55.89 

56.17 

59.49 

30.70 

30.24 

29.67 

29.68 

32.01 

32.93 

31.00 

29.81 

30.25 

28.81 

28.74 

26.65 

26.05 

25.81 

25.34 

29.54 

29.00 

31.52 
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10 
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20 



25 
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ATOM 


1380 


CA 


ASP 


480 


14.162 


19.800 


13.741 


1.00 


33.84 


ATOM 


1381 


CB 


ASP 


480 


13.943 


20.712 


12.528 


1.00 


34.37 


ATOM 


1382 


CG 


ASP 


480 


15.055 


21.743 


12.345 


1.00 


36.26 


ATOM 


1383 


ODl 


ASP 


480 


15.119 


22.354 


11.257 


1.00 


36.56 


ATOM 


1384 


OD2 


ASP 


480 


15.860 


21.951 


13.274 


1.00 


34.19 


ATOM 


1385 


C 


ASP 


480 


12.818 


1 9.222 


14.174 


1.00 


33.48 


ATOM 


1386 


O 


ASP 


480 


12.186 


19.724 


15.105 


1.00 


33.89 


ATOM 


1387 


N 


LYS 


481 


12.379 


18.161 


13.498 


1.00 


33.90 


ATOM 


1388 


CA 


LYS 


481 


11.106 


17.536 


13.839 


1.00 


32.97 


ATOM 


1389 


CB 


LYS 


481 


10.719 


16.489 


12.784 


1.00 


34.66 


ATOM 


1390 


C 


LYS 


481 


11.164 


16.895 


15.225 


1.00 


33.57 


ATOM 


1391 


O 


LYS 


481 


10.167 


16.869 


15.943 


1.00 


35.37 


ATOM 


1392 


N 


ILE 


482 


12.328 


16.377 


15.607 


1.00 


32.71 


ATOM 


1393 


CA 


ILE 


482 


12.457 


15.764 


16.922 


1.00 


31.60 


ATOM 


1394 


CB 


ILE 


482 


1 3'.743 


14.913 


17.028 


1.00 


32.65 


ATOM 


1395 


CG2 


ILE 


482 


13.877 


14.338 


18.430 


1.00 


32.50 


ATOM 


1396 


CGI 


ILE 


482 


13.697 • 


13.785 


15.995 


1.00 


32.72 


ATOM 


1397 


GDI 


ILE 


482 


14.978 


12.969 


15.908 


1.00 


JJ.J / 


ATOM 


1398 


c 


ILE 


482 


12.456 


16.853 


17.994 


1.00 


31.69 


ATOM 


1399 


o 


ILE 


482 


1 1 .946 


1 6.649 


19.097 


1. 00 


29.98 


ATOM 


1400 


N 


THR 


483 


13.027 


18.012 


17.679 


1.00 


31.33 


ATOM 


1401 


CA 


THR 


483 


13.022 


19.109 


18.644 


1.00 


31.71 


ATOM 


1402 


CB 


THR 


483 


13.756 


20.351 


18.109 


1.00 


32.92 


ATOM 


1403 


OGl 


THR 


483 


15.111 


20.012 


17.788 


1.00 


29.99 


ATOM 


1404 


CG2 


THR 


483 


13.756 


21.452 


19.160 


1.00 


30.47 


ATOM 


1405 


c 


THR 


483 


11.559 


1 9.483 


18.920 


1.00 


32.85 


ATOM 


1406 


o 


THR 


483 


11.146 


19.598 


20.070 


1.00 


31.83 


ATOM 


1407 


N 


ASP 


484 


10.785 


19.656 


17.851 


1.00 


31.91 


ATOM 

4 * A ^^^^ A * * 


1408 


CA 


ASP 


484 


9.369 


20.003 


17.965 


1.00 


34.15 


ATOM 


1409 


CB 


ASP 


484 


8.708 


20.013 


16.591 


1.00 


37.41 


ATOM 


1410 


CG 


ASP 


• 484 


9.270 


21.080 


15.680 


1.00 


42.02 


ATOM 


1411 


ODl 


ASP 


484 


9.871 


22.045 


16.198 


1.00 


43.26 


ATOM 


1412 


OD2 


ASP 


484 


9.106 


20.952 


14.445 


1.00 


42.49 


ATOM 


1413 


C 


ASP 


484 


8.657 


18.985 


18.840 


1.00 


33.16 


ATOM 


1414 


O 


ASP 


484 


7.830 


19.339 


19.676 


1.00 


34.86 


ATOM 


1415 


N 


THR 


485 


8.996 


17.715 


18.646 


1.00 


33.91 


ATOM 


1416 


CA 


THR 


485 


8.396 


16.635 


19.414 


1.00 


34.41 


ATOM 


1417 


CB 


THR 


485 


8.875 


15.268 


18.885 


1.00 


33.58 


ATOM 


1418 


OGl 


THR 


485 


8.400 


15.094 


17.542 


1.00 


37.04 


ATOM 


1419 


CG2 


THR 


485 


8.347 


14.138 


19.751 


1.00 


30.89 


ATOM 


1420 


C 


THR 


485 


8.708 


16.757 


20.903 


1.00 


35.15 


ATOM 


1421 


O 


THR 


485 


7.818 


1 6.600 


21.744 


1.00 


31.99 


ATOM 


1422 


N 


LEU 


486 


9.966 


1 7.046 


21.229 


1.00 


33.77 


ATOM 


1423 


CA 


LEU 


486 


10.368 


17.192 


22.621 


1.00 


34.31 


ATOM 


1424 


CB 


LEU 


486 


11.879 


1 7.448 


22.721 


1.00 


32.00 


ATOM 


1425 


CG 


LEU 


486 


12.776 


16.201 


22.754 


1.00 


34.99 


ATOM 


1426 


CDl 


LEU 


486 


14.233 


16.613 


22.521 


1.00 


32.65 


ATOM 


1427 


CD2 


LEU 


486 


12.635 


15.481 


24.105 


1.00 


29.90 
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ATOM 
ATOM 
ATOM 
ATOM 
5 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
1 0 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
15 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
20 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
25 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
30 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
35 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
40 ATOM 
ATOM 
ATOM 
ATOM 
ATOM 
45 ATOM 
ATOM 
ATOM 
ATOM 



1428 
1429 
1430 
1431 
1432 
1433 
1434 
1435 
1436 
1437 
1438 
1439 
1440 
1441 
1442 
1443 
1444 
1445 
1446 
1447 
1448 
1449 
1450 
1451 
1452 
1453 
1454 
1455 
1456 
1457 
1458 
1459 
1460 
1461 

1462 

1463 

1464 

1465 

1466 

1467 

1468 

1469 

1470 

1471 

1472 

1473 

1474 

1475 



C 
O 
N 
CA 
CB 
CG2 
CGI 
GDI 
C 
O 
N 
CA 
CB 
CG 
CD2 
NDI 
CEl 
NE2 
C 
O 
N 
CA 
CB 
CG 
GDI 
CD2 
C 
O 
N 
CA 
CB 
CG 
SD 
CE 
C 
O 
N 
CA 
CB 
C 
O 
N 

CA 
CB 
CG 
CD 
CE 
NZ 



LEU 
LEU 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
ILE 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
HIS 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
MET 
MET 
MET 
MET 
MET 
MET 
MET 
MET 
ALA 
ALA 
ALA 
ALA 
ALA 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 



486 
486 
487 
487 
487 
487 
487 
487 
487 
487 
488 
488 
488 
488 
488 
488 
488 
488 
488 
488 
489 
489 
489 
489 
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489 
489 
490 
490 
490 
490 
490 
490 
490 
490 
491 
491 
491 
491 
491 
492 
492 
492 
492 
492 
492 
492 



9.597 
9.078 
9.513 
8.787 
8.890 
7.833 
1 0.292 
10.635 
7.315 
6.708 
6.749 
5.357 
4.962 
3.612 
2.369 
3.440 
2.148 
1.477 
5.154 
4.233 
6.022 
5.936 
7.087 
6.961 
8.259 
5.799 
5.973 
5.267 
6.798 
6.939 
8.208 

9.495 

10.978 

12.178 

5.718 

5.296 

5.162 

3.983 

3.622 

2.841 

2.073 

2.752 

1.711 

1.772 

1.087 
■0.002 
■0.988 
■1.351 
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18.348 

18.225 

19.469 

20.625 

21.826 

22.884 

22.443 

23.041 

20.257 

20.628 

19.521 

19.096 

18.282 

17.647 

18.175 

16.298 

16.023 

17.144 

18.254 

18.498 

1 7.266 

16.399 

15.396 

14.242 

13.456 

13.345 

17.203 

16.888 

18.246 

19.102 

19.953 

19.169 

20. 1 06 

18.775 

20.004 

20.258 

20.498 

21.351 

21.879 

20.5 1 0 

20.967 

19.268 

18.351 

17.053 

17.135 

16.084 

16.453 

15.281 



23.256 

24.362 

22.548 

23.064 

22.095 

22.443 

22.181 

23.544 

23.276 

24.282 

22.326 

22.427 

21.197 

21.305 

21.214 

21.534 

21.577 

21.385 

23.685 

24.467 

23.879 

25.048 

25.048 

24.056 

24.027 

24.459 

26.339 

27.298 

26.353 

27.522 

27.394 

27.608 

27.161 

27.056 

27.717 

28.848 

26.616 

26.693 

25.311 

27.251 

28.095 

26.783 

27.222 

26.411 

25.062 

24.930 

23.827 

22.976 



1.00 

1. 00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 



34.87 

35.85 

35.59 

36.79 

37.32 

40.19 

36.00 

33.58 

38.56 

38.52 

40.33 

42.29 

44.26 

47.75 

47.46 

51.09 

51.15 

50.22 

42.55 

43.02 

39.91 

39.93 

38.83 

39.31 

39.01 

41.98 

40.24 

38.72 

39.94 

41.50 

39.15 

41.69 

35.76 

39.22 

42.33 

41.09 

43.15 

43.79 

43.93 

46.16 

44.69 

46.29 

49.90 

50.03 

53.81 

59.00 

61.85 

62.89 
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ATOM 


1476 


C 


LYS 


492 


1.841 


18.025 


28.701 


1.00 


51.15 




ATOM 


1477 


0 


LYS 


492 


0.845 


17.784 


29.379 


1.00 


53.37 




ATOM 


1478 


N 


ALA 


493 


3.072 


18.012 


29.199 


1.00 


50.15 




ATOM 


1479 


CA 


ALA 


493 


3.321 


17.706 


30.600 


1. 00 


49.17 


5 


ATOM 


1480 
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CLAIMS 

WHAT IS CLAIMED IS: 

1 . A method of identifying a compound that modulates nuclear receptor activity, said method 
comprising: 

5 modeling test compounds that fit spatially into a nuclear receptor ligand binding domain of 

interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or ponion thereof, 

screermg said test compounds in an assay characterized by binding of a test compound to 
the ligand binding domain, and 

identifying a test compound that modulates nuclear receptor activity, wherein said atomic 
structural model comprises atomic coordinates of amino acid residues corresponding to 
residues of human estrogen receptor a Met343, Leu346, AIa350, Glu353, Leu384, Leu387, 
Leu39L Arg394, Phe404, Met421, Leu428, GIy52L His524, Leu525 and Met528, 

2, The method of Claim 1 wherein the amino acid residues correspond to residues Met343, 
1 5 Met42 1 , His524, Leu525 and Met528. 

3, The method of Claim 1 wherein the test compound is an agonist and nuclear receptor 
activity is measured by binding of a coactivator to the coactivator binding site. 

4. The method of Claim I wherein the test compound is an antagonist and nuclear receptor 
activity is measured by the imwinding of helix 12. 

20 5. The method of Claim 1 wherein the test compound is an antagonist and nuclear receptor 

activity is measured by the blocking of coactivator binding. 

6. The method of Claim 1 wherein said screening is in vitro, 

7. The method of Claim 6 wherein said screening is high throughput screening. 

8. The method of Claim 1 wherein said test compound is from a library of compounds. 
The method of Claim 1 wherein said test compound is a small organic molecule, a peptide, 
or peptidomimetic. 

10. The method of Claim 1 which further comprises the step of providing the atomic 
coordinates of the estrogen receptor a ligand binding domain or portion thereof to a 
computerized modeling system, prior to said modeling step. 

The method of Claim 1 wherein said nuclear receptor is selected from the group consisting 
of estrogen receptors, thyroid receptors, retinoid receptors, glucoconicoid receptors. 
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progestin receptors, mineralocorticoid receptors, androgen receptors, peroxisome receptors 
and vitamin D receptors. 

12. The method of Claim 1 1 wherein said nuclear receptor is an estrogen receptor. 

13. The method of Claim 1 2 wherein said estrogen receptor is the estrogen receptor a. 

5 14. A method of identifying a compound that modulates ligand binding to a nuclear receptor, 
said method comprising: 

modeling test compounds that fit spacially into a nuclear receptor ligand binding domain of 
interest using an atomic structural model of the estrogen receptor a ligand binding domain 
or portion thereof, 

10 screening said test compounds in an assay characterized by binding of a test compoimd to 

the binding domain, and 

identifying a test compound that modulates ligand binding to said nuclear receptor, 
wherein said atomic structural model comprises atomic coordinates of amino acid residues 
corresponding to residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353, 
15 Leu384, Leu387, Leu39K Arg394, Phe404, Met42l, Leu428, Gly52U His524, Leu525 and 

Met528. 

15. The method of Claim 14 wherein the amino acid residues correspond to residues Met343, 
Met421, His524, Leu525 and Met528. 

16. The method of Claim 14 wherein said nuclear receptor is selected from the group 
20 consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 

receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

17. The method of Claim 16 wherein said nuclear receptor is an estrogen receptor. 

1 8. The method of Claim 1 7 wherein said estrogen receptor is the estrogen receptor a. 
25 19. The method of Claim 14 wherein said screening is in vitro, 

20. The method of Claim 19 wherein said screening is high throughput screening. 

21. The method of Claim 14 wherein said test compound is from a library of compounds. 

22. The method of Claim 14 wherein said test compound is an agonist or antagonist of ligand 
binding. 

30 23, The method of Claim 14 wherein said test compound is a small organic molecule, a 

peptide, or peptidomimetic. 
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24. A method for identifying an agonist or antagonist of ligand binding to a nuclear receptor, 
said metliod comprising the steps of: 

providing the atomic coordinates of the estrogen receptor a ligand binding domain or 
ponion thereof to a computerized modeling system, wherein said atomic coordinates are of 
5 the amino acid residues corresponding to residues of human estrogen receptor a Met343, 

Leu346, Ala350, GIu353, Leu384. Leu387, Leu391, Arg394, Phe404, Met42L Leu428, 
Gly52U His524, Leu525 and Met528; 

modeling compounds which fit spacially into the ligand binding domain; and 
identifying in an assay for nuclear receptor activity a compound which increases or 
10 decreases the activity of the nuclear receptor by binding the ligand binding domain of said 

nuclear receptor, whereby an agonist or antagonist of ligand binding is identified. 

25. The method of Claim 24 wherein the amino acid residues correspond to residues Met343, 
Met42 1 , His524. Leu525 and Met528. 

26. The method of Claim 24 wherein said nuclear receptor is selected from the group 

15 consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 

receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

27. The method of Claim 26 wherein said nuclear receptor is an estrogen receptor. 

28. The method of Claim 27 wherein said estrogen receptor is the estrogen receptor a. 
20 29. A method of modulating nuclear receptor activity in a mammal by administering to a 

mammal in need thereof a sufficient amount of a compound that fits spatially and 
preferentially into a ligand binding domain of a nuclear receptor of interest, where said 
compound is designed so as to distort at least one amino acid residue corresponding to 
residues of human estrogen receptor a Met343, Leu346, Ala350, Glu353. Leu384, Leu387, 
25 Leu39K Arg394, Phe404, Met42l, Leu428, Gly521, His524, Leu525 and Met528. 

30. The method of Claim 29 wherein at least one amino acid residue corresponds to residues 
Met343. Met42l, His524, Leu525 and Met528. 

3 1 . The method of Claim 29 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 

30 receptors, progestin receptors, mineralocorticoid receptors, androgen receptors. 

peroxisome receptors and vitamin D receptors. 

32. The method of Claim 3 1 wherein said nuclear receptor is an estrogen receptor. 
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33. The method of Claim 32 wherein said estrogen receptor is the estrogen receptor a. 

34. A machine-readable data storage medium, comprising a data storage material encoded with 
machine readable data which, when using a machine programmed with instructions for 
using said data, is capable of displaying a graphical three-dimensional representation of a 

5 molecular complex of a compound bound to a nuclear receptor ligand binding domain 

comprising structure coordinates of amino acids corresponding to human estrogen receptor 
a Met343, Leu346, Ala350, Glu353, Leu384, Leu387, Leu39K Arg394, Phe404, Met42l, 
Leu428, Gly521, His524, Leu525 and Met528 or a homologue of said molecular complex, 
wherein said homologue comprises a ligand binding domain that has a root mean square 

10 deviation from the backbone atoms of said amino acids of not more than l.SA. 

35. The machine-readable data storage medium of Claim 34 wherein the amino acid residues 
correspond to residues Met343, Met421, His524, Leu525 and Met528. 

36. The machine-readable data storage medium of Claim 34 wherein said nuclear receptor is 
selected from the group consisting of estrogen receptors, thyroid receptors, retinoid 

15 receptors, glucocorticoid receptors, progestin receptors, mineralocorticoid receptors, 

androgen receptors, peroxisome receptors and vitamin D receptors. 

37. The machine-readable data storage medium of Claim 36 wherein said nuclear receptor is 
an estrogen receptor. 

38. The machine-readable data storage medium of Claim 37 wherein said estrogen receptor is 
20 the estrogen receptor a. 

39. The machine-readable data storage medium of Claim 34 wherein said molecular complex 
is defined by the set of structure coordinates depicted in Appendix 1 or Appendix 2, or a 
homologue of said molecular complex, said homologue having a root mean square 
deviation from the backbone atoms of said amino acids of not more than 1 .5A. 

25 40. A machine-readable data storage medium comprising a data storage material encoded with 

a first set of machine readable data which, when combined with a second set of machine 
readable data, using a machine programmed with instructions for using said first set of data 
and said second set of data, can determine at least a portion of the structure coordinates 
corresponding to the second set of machine readable data, wherein: said first set of data 

30 comprises a Fourier transform of at least a ponion of the structural coordinates selected 

from the group consisting of coordinates depicted in Appendix 1 or Appendix 2; and said 
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second set of data comprises an X-ray diffraction pattern of a molecule or molecular 
complex, 

4 1 . The machine-readable data storage medium of Claim 40 wherein said nuclear receptor is 
selected from the group consisting of estrogen receptors, thyroid receptors, retinoid 
receptors, glucocorticoid receptors, progestin receptors, mineraloconicoid receptors, 
androgen receptors, peroxisome receptors and vitamin D receptors. 

42. The machine-readable data storage medium of Claim 41 wherein said nuclear receptor is 
an estrogen receptor. 

43. The machine-readable data storage medium of Claim 42 wherein said estrogen receptor is 

the estrogen receptor a. 

44. A cocrystal of a nuclear receptor comprising an agonist bound to the ligand binding 
domain and a molecule bound to the coactivator binding site of the nuclear receptor, 
wherein said crystal defracts with at least 2.03 A resolution. 

45. The cocrystal of Claim 44 wherein said nuclear receptor is the estrogen receptor a. 

46. The cocrystal of Claim 45 wherein said estrogen receptor a is human. 

47. The cocrystal of Claim 46 wherein said molecule is peptide. 

48. The cocrystal of Claim 47 wherein said peptide comprises a NR-box amino acid sequence 
or derivative thereof. 

49. A cocrystal of a nuclear receptor comprising an antagonist bound to the ligand binding 
domain of the nuclear receptor, wherein said crystal defracts with at least 1.9A resolution. 

50. The cocrystal of Claim 49 wherein said nuclear receptor is the estrogen receptor a, 

51. The cocrystal of Claim 50 wherein said estrogen receptor a is human. 

52. A computational method of designing a nuclear receptor ligand where at least one amino 
acid residue of a nuclear receptor LBD that corresponds to human estrogen receptor a 
Met343, Leu346, Ala350, GIu353, Leu384, Leu387, Leu391, Arg394, Phe404, Met421, 
Leu428, Gly521, His524, Leu525 and Met528, interacts with at least one first chemical 
moiety of said ligand. comprising the step of selecting at least one chemical modification 
of said first chemical moiety to produce a second chemical moiety with a structure to either 
decrease or increase an interaction between said interacting amino acid and said second 
chemical moiety compared to said interaction between said interacting amino acid and said 
first chemical moierv. 
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53. The method of Claim 52 wherein at least one amino acid residue corresponds to residues 
Met343. Met42 1 . His524, Leu525 and Met528. 

54. The method of Claim 52 further comprising determining a change in interaction between 
said interacting amino acid and said ligand after chemical modification of said first 
chemical moiety. 

55. The method of Claim 52 wherein said chemical modification enhances hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der WaaJs interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 
The method of Claim 52 wherein said chemical modification reduces hydrogen bonding 
interaction, charge interaction, hydrophobic interaction. Van Der Waals interaction or 
dipole interaction between said second chemical moiety and said interacting amino acid 
compared to said first chemical moiety and said interacting amino acid. 
The method of Claim 52 wherein said nuclear receptor is selected from the group 
consisting of estrogen receptors, thyroid receptors, retinoid receptors, glucocorticoid 
receptors, progestin receptors, mineralocorticoid receptors, androgen receptors, 
peroxisome receptors and vitamin D receptors. 

The method of Claim 57 wherein said nuclear receptor is an estrogen receptor. 
The method of Claim 52 wherein the estrogen receptor is the estrogen receptor a. 
20 60. The method of Claim 59 wherein the ligand is an agonist. 

61. The method of Claim 60 wherein the ligand is selected from the group consisting of 1 7p. 
estradiol, diethylstilbestrol, moxestrol, mesohexestrol, coumestrol. a'-THC. o,p-DDT, 
zearalenone and kepone. 

The method of Claim 61 wherein the ligand is 1 7p-estradiol, and the first chemical moiety 
is a free carbon of the A' ring located at a position selected from the group consisting of 
C6a, C7a. C 1 2a, C 1 5a, C 1 6a and C 1 7a. 
63. The method of Claim 59 wherein the ligand is an antagonist. 

The method of Claim 63 wherein the ligand is selected from the group consisting of ICI 
164,384 and EM800. 
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The method of Claim 59 wherein the ligand is a selective estrogen receptor modulator. 
The method of Claim 65 wherein the ligand is selected from the group consisting of 
tamoxifen, raloxifene and GW5638. 
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67. A method of modulating nuclear receptor activity in a mammal by administering to a 

mammal in need thereof a sufficient amount of a ligand that fits spatially and preferentially 
into a ligand binding domain of a nuclear receptor of interest, wherein said ligand is 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor ligand binding domain that corresponds to human estrogen receptor a Met343, 
Leu346, Ala350, GIu353. Leu384. Leu387, Leu39I, Arg394. Phe404, Met421. Leu428. 
Gly521, His524, Leu525 and Met528, interacts with ^t least one first chemical moiety of 
said ligand, comprising the step of selecting at least one chemical modification of said first 
chemical moiety to produce a second chemical moiety with a structure to either decrease or 
increase an interaction between said interacting amino acid and said second chemical 
moiety compared to said interaction between said interacting amino acid and said first 
chemical moiety. 

The method of Claim 67 wherein at least one amino acid residue corresponds to residues 
Met343, Met421, His524, Leu525 and Met528. 
1 5 69. The method of Claim 67 wherein said ligand is an antagonist. 
70. The method of Claim 67 wherein said ligand is an agonist. 

The method of Claim 70 which further comprises administering a coactivator mimic 
designed by a computational method where at least one amino acid residue of a nuclear 
receptor coactivator binding site that corresponds to human estrogen receptor a helix 3 
20 residues Leu354, Val355, Met357, Ile358. Ala361 and Lys362. helix 4 residue Phe367, 

helix 5 residues Gln375, Val376, Leu379 and Glu380, helix 6 residue Trp383. and helix 12 
residues Asp538, Leu539, Glu542, Met543 and Leu544, interacts with at least one first 
chemical moiety of said coactivator mimic, comprising the step of selecting at least one 
chemical modification of said first chemical moiety to produce a second chemical moiety 
with a structure to either decrease or increase an interaction between said interacting 
amino acid and said second chemical moiety compared to said interaction between said 
interacting amino acid and said first chemical moiety. 
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